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Abstract 

The index coding problem is studied from an interference alignment perspective, providing new re- 
sults as well as new insights into, and generalizations of, previously known results. An equivalence is 
established between multiple unicast index coding where each message is desired by exactly one receiver, 
and multiple groupcast index coding where a message can be desired by multiple receivers, which settles 
the heretofore open question of insufficiency of linear codes for the multiple unicast index coding prob- 
lem by equivalence with multiple groupcast settings where this question has previously been answered. 
Necessary and sufficient conditions for the achievability of rate half per message are shown to be a natural 
consequence of interference alignment constraints, and generalizations to feasibility of rate per mes- 
sage when each destination desires at least L messages, are similarly obtained. Finally, capacity optimal 
solutions are presented to a series of symmetric index coding problems inspired by the local connectivity 
and local interference characteristics of wireless networks. The solutions are based on vector linear coding. 
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1 Introduction 



Much progress in network information theory can be attributed to the pursuit of the capacity of simple- 
to-describe canonical network communication models. Simplicity in the network communication models 
often affords a clear formulation of techniques involved in the communication system. The focus of this 
paper is the index coding problem which is arguably the simplest multiuser capacity problem because it is a 
communication network that has only one link with finite capacity. Yet, this turns out to be the proverbial case 
where appearances can be quite deceiving. More than a decade after it was introduced by Birk and Kol in 
Q~l|2l/ * ne m dex coding problem not only remains open, but also has been shown to include as special cases 
a number of difficult problems in both wired and wireless settings — such as the general multiple unicast 
problem with linear network coding [3J, multi-way relay networks [4J, and the blind cellular interference 
alignment problem in wireless networks f5j, to name a few. Remarkably, the index coding problem is also 
the origin of the fundamental idea of interference alignment |2 1, which was re-discovered, extensively stud- 
ied and developed in a variety of forms in wireless networks 1 6 ,7 , 8 1 and has recently found applications in 
network coding problems such as the distributed data storage exact repair problem |9 . 10 1 and the 3 unicast 
problem liTTllTZl . In this paper, we attempt to bring this idea "home", by applying the understanding of the 
principles of interference alignment, into the original setting — the index coding problem. 

The essence of the index coding problem lies in its focus on a single bottleneck network. Having only 
one link with finite capacity concentrates the challenge of network coding in one place, highlighting some of 
the most fundamental, challenging, and surprising aspects of the network coding problem. Understanding 
the role of a single bottleneck edge in a network when the rest of the network is composed only of trivial 
links (of infinite capacity), is a natural stepping stone toward a broader understanding of communication 
networks^] We start with a discussion of similarly motivated single-bottleneck settings for both wired and 
wireless networks. 

1.1 Single Bottleneck Wired Networks - Index Coding 

Consider a general network coding setting shown in Figure [TJa) where the source nodes on the left com- 
municate with the destination nodes on the right through a network of intermediate nodes connected via 
orthogonal, noiseless, capacitated links. If only one link (shown in black) in the intermediate network has 
finite (unit) capacity, and all the other links have infinite capacity, then the remaining problem is the index 
coding problem. Clearly, the only non-trivial message flows are those for which every path between the 
source and the desired destination(s) must pass through the finite capacity link. All other messages have 
either rate zero or infinity, and can be eliminated. The remaining network contains three kinds of infinite 
capacity links in addition to the finite capacity link. First, the remaining source nodes must connect to the 
transmitter of the finite capacity link via infinite capacity links. Second, the destination nodes must connect 
to the receiver of the finite capacity link through infinite capacity links. Third, if there are infinite capacity 
paths between any sources and their non-desired destinations, those paths are replaced with infinite ca- 
pacity links known as "antidote" links. An example is shown in Figure [TJb) where 5 messages originate 
at the sources on the left and are desired by the destination nodes on the right. The bottleneck link (finite 
capacity link) is shown at the top of the figure in black and carries a sequence of symbols from a finite al- 
phabet S n E S n , which are chosen with full knowledge of all messages. The antidote links are shown in red. 
Clearly, the best use of the antidote links, which have infinite capacity, is to convey all the information, i.e., 
the messages, from the transmitters to the receivers of the antidote links. These antidotes comprise the side 
information that makes the problem interesting and quite challenging in general. Each destination must be 
able to decode its desired message based on the sequence of symbols sent over the bottleneck link and the 
set of undesired messages available to it as antidotes. 

Index coding can also be seen as "source coding with side information", or as a broadcast channel with 
cognitive receivers, i.e., where certain receivers have full knowledge of certain messages a-priori. 

1 The edge-removal problem introduced in I13II14I is another intriguing open problem that seeks to understand the role of a single 
edge in a network, and has been shown to be closely related to the general question of whether zero-error capacity and e-error capacity 
are the same for general network coding instancesl 15 [. 
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Figure 1 : (a) General network coding problem: If only one link (shown in black) in the intermediate network has finite 
(unit) capacity, and all the other links have infinite capacity, then the remaining problem is the index coding problem, 
(b) Example of an index coding setting. 

1.2 Single Bottleneck Wireless Networks - Wireless Index Coding 

Consider a wireless network shown in Figure |2ja) comprised of the source nodes shown on the left, which 
communicate with destination nodes shown on the right, through an intermediate network of relay nodes. 
Depending on propagation path loss different pairs of nodes may be connected or disconnected. Because 
this is a wireless setting, signals emerging from the same transmitter are broadcast, and signals arriving at 
the same receiver interfere. All transmitters are subject to power constraint P, and generally the receivers 
experience additive white Gaussian noise (AWGN) in addition to the superposition of fading signals from 
connected transmitters. As an analogue to the index coding problem defined by a single bottleneck link, let 
us assume only one of the receivers in the intermediate network experiences AWGN, e.g., of unit variance, 
while all other receivers experience no noise, i.e., have infinite resolution of the complex valued signals, 
essentially providing them infinite capacity links to their respective connected transmitters. Eliminating 
messages that have infinite capacity paths between their sources and all their desired destinations, what 
remains is the wireless index coding problem, introduced in [5|. While, depending on the wireless network 
topology, the resulting wireless index coding problem can in general be quite involved, e.g., if the wireless 
network contains cycles that provide feedback from the output of the bottleneck receiver to the distributed 
or partially cooperating nodes transmitting to the bottleneck receiver, Fig. |2jb) shows a relatively simple 
example of the wireless index coding problem that corresponds to the index coding problem of Fig. [lpb), 
in the sense that the capacity of the index coding problem maps directly to the degrees of freedom (DoF) of 
the wireless index coding problem. The capacity per message of the index coding problem in FigureJTJb) is 
2/5, as is the DoF value per message for the wireless index coding problem in Fig. |2|b), and in both cases 
the "unit" for measurement is the capacity/DoF of the bottleneck link/ receiver. The index coding problem 
normalizes the bottleneck link capacity to unity, so that all rates are measured as multiples of the bottleneck 
link capacity, and the wireless index coding problem normalizes the number of signal dimensions (DoF) 
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Figure 2: (a) Wireless network: If only one receiver (shown with incoming signals in black) in the intermediate network 
has non-zero (unit) AWGN variance, and all the other receivers have zero noise (infinite capacity), then the remaining 
problem is the wireless index coding problem, (b) Example of a wireless index coding setting. 

available to the bottleneck receiver to unity, and all DoF are measured as multiples of the bottleneck DoF. 
As explained in [5J, the relationship between the index coding problem and the wireless index coding prob- 
lem goes much further, and much more can be said about their similarities and differences. For instance, if 
full cooperation is allowed between all sources directly transmitting to the bottleneck receiver in a wireless 
index coding problem, the DoF of the resulting network is the same as the capacity of the corresponding 
index coding problem (in their respective units). The DoF of the wireless index coding problem are, in 
general, bounded above by the capacity of the index coding problem. It also highlights the main difference 
between the index coding problem and the wireless index coding problem — all sources are necessarily 
allowed to fully cooperate in the former because the bottleneck transmitter has full knowledge of all mes- 
sages, but not necessarily in the latter (depending on the topology of the original network in Figure |2ja)). 
However, if the index coding problem has a capacity optimal vector linear coding solution that can be trans- 
lated to the complex field, then the same solution may be applied in the wireless index coding problem as 
well. This is because vector linear solutions are comprised of a superposition of separately encoded mes- 
sages, and a superposition over complex field is naturally provided by the wireless medium [5J. Somewhat 
surprisingly, this is a very common situation, e.g., all the instances of the index coding problems studied in 
this paper have capacity optimal vector linear coding solutions that translate to the complex field, thereby 
simultaneously providing the DoF characterization for the corresponding wireless index coding problem. 

In the wireless index coding problem discussed above, the bottleneck is concentrated at one receiver, 
lending the bottleneck a multiple access character. Another formulation of the wireless index coding prob- 
lem is also conceivable where the bottleneck may be concentrated at one transmitter, e.g., all receivers 
experience additive noise and there is only one transmitter with finite power (all other transmitters have 
infinite power), which would lend the bottleneck a broadcast character, and which could be a similarly 
interesting and promising research avenue. 



The motivation for studying single bottleneck networks in both wired and wireless settings is evident 
from an information-theoretical perspective as a stepping stone to a broader understanding of communi- 
cation networks. What is surprising is that the index coding problem, in spite of its simple formulation, 
not only already captures much of the complexity of the full-fledged network capacity problem, but also 
contains a class of problems known as cellular blind interference alignment [5] problems (CBIA) which are 
of immediate practical interest for cellular wireless networks. Indeed, it is the CBIA setting that motivates 
most of the instances of index coding that we solve in this work. 



1.3 Cellular Blind Interference Alignment Problem 

Consider, as an example, the cellular downlink setting shown in Figure |3|a) comprised of 5 partially over- 
lapping cells depicted as circles, with each circle containing a transmitter (base station) near its center, 
shown as a black square, and 3 receivers (users), shown as white squares. Propagation path loss is modeled 
by the assumption that each base station transmitter (black square) is only heard within the circular region 
defining its own cell. This gives rise to the connectivity pattern where each transmitter can be heard by 
three receivers and each receiver can hear three transmitters. The resulting wireless network connectivity 
is shown in Figure[3jb) where the links show the non-zero channel coefficients. The knowledge of non-zero 
channel coefficient values, which are assumed to be drawn from identical distributions, is not available to 
the (blind) transmitters. Depending upon the message sets, e.g., whether each transmitter sends a mes- 
sage to only one user in its cell, or whether each transmitter sends 3 independent messages to the 3 users 
in its cell, we have the partially connected interference channel or X channel setting, respectively. Since 
both settings will benefit significantly from interference alignment and no knowledge of non-zero channel 
coefficient values is assumed, this is known as the cellular blind interference alignment problem [5J. 
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Figure 3: Cellular blind interference alignment problem: (a) Cellular layout governing the connectivity pattern, (b) 
Locally connected network representation, (c) Corresponding index coding problem 

The key to the CBIA problem lies in its close relationship to a corresponding index coding problem. 
For the CBIA problem of Figure |3|a) and Figure |3]b>), the corresponding index coding problem is shown in 
Figure|3|c). In the corresponding index coding problem, the graph connecting black (transmitter) and white 
(receiver) nodes in Figure 3jc) is complementary to that in Figure |3|b), i.e., a black and white node pair is 
connected in the index coding problem of Figure[3jc) by an antidote link if and only if it is not connected in 
the locally connected network representation of Figure|3jb). Thus, antidote links which allow a receiver to 
subtract from its received signal the contribution from the corresponding undesired messages, especially 
in a superposition based coding scheme as must be the case in the CBIA problem due to the distributed 
nature of the source nodes and the additive nature of the wireless medium, play the same role as a zero 



channel coefficient value in the local connectivity pattern. The main difference in the index coding problem 
is that full cooperation between sources is allowed, which makes the index coding capacity an outer bound 
on the DoF of the CBIA setting. However, since the optimal solution to the index coding problem is often 
based on vector linear coding and robust to the choice of the underlying field as the real or complex field, 
as we will see, e.g., for the problem in Fig. |3]as well as for several other index coding problems, the optimal 
solution of the index coding problem will automatically provide the optimal solution to the CBIA problem 
as well. 

Finally, we note that the CBIA problem has a natural counterpart in the network coding setting, which 
is a blind linear network coding problem. Consider, for example, a wired network where the intermediate 
nodes perform random linear network coding, creating a linear channel matrix comprised of polynomials 
in the network coding coefficients of the intermediate nodes. Recent work IITTl Il6l IT2l 117 ] has investigated 
how to exploit the knowledge of these effective channel matrices at the source nodes to achieve interference 
alignment by linear precoding, in a manner that mimics the wireless interference channel. However, sup- 
pose that the source nodes only know the end-to-end connectivity but do not know the channel coefficient 
values, e.g., because they do not keep track of all the network coding coefficients. Aside from the significant 
distinction of working over finite fields, the resulting blind network coding problem is virtually identical to 
the CBIA problem, and is similarly related to a corresponding index coding problem. For instance, if we 
take the graph of Figure|3|b) to represent the resulting connectivity of a blind network coding problem, with 
no knowledge of the channel realizations at the transmitters, then the corresponding index coding problem 
is shown in Figure|3|c). Solving this index coding problem will solve both the corresponding CBIA problem 
as well as the corresponding blind network coding problem. 

The preceding discussion of closely related problems sheds light on the significance of the index coding 
problem. In spite of having only one finite capacity link, the richness of the index coding problem is evident, 
and its complexity is further underscored in the result by Rouayheb et al. in Q where an equivalence is 
established between the general network coding problem restricted to linear codes, and the index coding 
problem. On the other hand, the relative simplicity of the index coding problem setting does make this 
setting more tractable. As an example, we note that the equivalence of e-error capacity and zero-error 
capacity, which remains open in the general network coding problem, has been established for the index 
coding problem |15l. 

2 Index Coding - Problem Formulation 

The index coding problem consists of a set of M independent messages 

W = {W ll W 2 ,...,W M }, 

and a set of K destination nodes 

V = {Di,D 2 ,--- ,D K }, 
with the k th destination node D k identified as 

D k = (W k ,A k ) 

where W k C W is the set of messages desired by D k , the set A k C W is comprised of the messages available 
to destination D k as side information (antidotes), and W k H A k = <f>, i.e., a destination node does not desire 
a message that is already available to it. 

An (S, n, TV) index coding scheme corresponds to the choice of a finite alphabet S of cardinality \S\ > 1, 
a coding function, /, and a decoding function g k i, for each desired message Wi at each destination D k . The 
coding function / maps all the messages to the sequence of transmitted symbols 

/(Wi.Wa,— ,W M ) = S n 

where S n € S n is the sequence of symbols transmitted over n channel uses. Here, Vm G {1,2, •• • , M}, 
message W m is a random variable uniformly distributed over the set 

W m €{l,2,--,\S\ nR ™}, 



and 1Z is simply the rate vector 

7^ = (R17R2, ■ ■ ■ ,Rm)- 
At each destination, Dk, there is a decoding function for each desired message 

g Kl (S n ,A k ) = W Kl , Vt such that W l G W k . 
The decoding is said to be in error if any desired message is decoded incorrectly. The probability of error is 

P e = 1 - Prob[W fejl = Wi, Vi, fc such that Wj G W k }. 

A rate tuple 1Z = ,R 2 ,..., i?M ) is said to be achievable if for every e, (5 > there exists a (5, n, (R\ , R2, ■ ■■ , Rm)) 
coding scheme, for some S,n, such that Vra G {1,2, . . . ,M}, R m > R m — 5, and the probability of error 
Pe _ £■ The capacity region of the index coding problem is defined as the set of all achievable rate tuples 
(R 1 ,R 2 , Rm) and is denoted by C. 

As an example of the notation, in Figure [TJb), we have M = K = 5, W = {W\,Wz, ■ ■ ■ ,W 5 }, D ± = 
({Wi}, {W 5 , W 2 }), D 2 = {{W 2 }, {Wi, Wi}), D 3 = {{W 3 }, {W 2 , Wi}), D 4 = ({Wi}, {W 3 , W 5 }), D 5 = {{W 5 }, {Wi, Wi}). 

The definition of capacity region presented above is in the classical sense of asymptotically vanishing 
probability of error, also known as e-error capacity, which can in general be larger than the zero-error 
capacity where only achievable schemes with P e = are allowed. Even for networks comprised of noise- 
less links, e.g., in the network coding problem, the equivalence of the two is not known. Remarkably, for 
index coding, it has been shown by Langberg and Effros in lfT5ll that the e-error capacity is the same as the 
zero-error capacity. 

It is noteworthy that the choice of the alphabet, S, is inconsequential for the capacity region, e.g., one 
could restrict S = {0, 1} without affecting the capacity as defined above, or interpret multiple channel uses 
as a single channel use over a larger alphabet, again without impacting capacity. Note, in particular, that the 
rates are measured in base-5 units, and the capacity of the bottleneck link is automatically normalized to 
one unit. The choice of alphabet is important to distinguish between linear and non-linear coding schemes, 
or scalar and vector coding, as will be explained later in this section. 

We refer to the general index coding problem statement presented above as the multiple groupcast 
setting, where each message may be desired by multiple destination nodes. This general term includes 
within its scope both the multiple unicast setting where each message is desired by exactly one destination 
node, and the multiple multicast setting where each message is desired by all destination nodes. 

Notation: For any subset of messages W C VV we define the compact notation, 

W c = w-W 

as the set of messages in W that are not in W. Further, we use the compact notation Wij — {Wi,Wj}, Wi^k = 
{Wi, Wj,Wk}, etc. Also we define the compact notation, R il . L = R n + R l2 + . . . + Ri L ,JC = {1,2,..., K} and 
M {1.2 \1\ 

A few important classes of the index coding problem are formalized next. 
Definition 1 — Multiple Unicast Index Coding 

The index coding problem is called a multiple unicast index coding problem if and only if 

Vfci, k 2 G AC, fci ^ k 2 , W kl n W k2 = 4> (1) 

In other words, a multiple unicast index coding problem is one where no message is desired by more than 
one destination. 

Definition 2 — Scalar Index Coding Scheme 

An (S, n, TZ) index coding scheme is called a scalar coding scheme if and only if 

n = i'.'.-. ; ) ,2) 

\n n n J 



In other words, a scalar index coding scheme sends one symbol for each message over n channel uses. 



Definition 3 — Linear Index Coding Scheme 

A linear (S,n,TZ) index coding scheme, achieving the rate vector 1Z= ( ^ , ^ , • • • , ^ ) over n channel uses, corre- 
sponds to a choice of 

1. a finite field F = S as the alphabet 

2. V m £ F™ xLm , Vm £ M as precoding matrices 

3. X5 m .k £ F LmX ", Vm £ M,Vk such that W m £ W k , as receiver combining matrices 
such that the following properties are satisfied 

Property 1: U mifc Vj = 0, Vm, i £ M,k £ JC such that m^i, W m £ W fe , W t £ Ak 
Property 2: det (U m> feV m ) ^ 0, Vm £ M,k £ JC such that W m £ Wk 

where all operations are over F. 

The transmitted symbol sequence S n £ F" xl in a linear index coding scheme is 

M 
m— 1 

where X, = {x m ,i,x m , 2 , • • • -,x m .L m ) T £ F LmXl is an L m x 1 vector representing W m . In other words, message 
W m is split into L rn independent scalar streams, each of which carries one symbol from F, and is transmitted 
along the corresponding column vectors (the "beamforming" vectors) of the precoding matrix for V TO . The 
decoding operation for message W m £ Wk, desired at destination D k , is 

V WiGA k ) 

Thus, first the contribution from undesired messages available as antidotes, Wi £ Ak, is eliminated from 
S n , then the remaining undesired symbols are zero-forced by Property 1, and finally the desired symbols 
X m are recovered by the invertibility of U TOi feV TO , which is guaranteed by Property 2. 

Note that a linear encoding scheme, as explained above, is a zero-error encoding scheme. The linear 
coding scheme described above is also called vector linear coding schemes. This includes the special case 
where L t = 1, Vi £ {1, 2, • • • , M}, which is called a scalar linear encoding scheme. 



3 Index Coding as an Interference Alignment Problem 

In wireless communications, interference is a natural phenomenon because of the broadcast nature of the 
medium. In wireline network communications, interference occurs because of multiple data streams con- 
tending for a common link. Indeed this scenario is best exemplified by the index coding problem where a 
single bottleneck link is shared by all data streams. In wireless systems, interference alignment provides 
surprising gains by exploiting the inherent diversity in distributed linear systems - i.e., the notion that ev- 
ery receiver sees a different alignment of signal dimensions, and therefore signals can be designed to align 
at one receiver and stay separable at a different receiver. In the index coding problem too, we exploit the 
inherent diversity that exists among the receivers because the set of antidotes is different for different re- 
ceivers. This diversity ensures that even if two signals align along the botteleneck link, they can be decoded 
at the desired receivers if they have the appropriate set of antidotes. This alignment frees up the available 
dimensions (on the botteleneck link) for other messages and hence makes the system more efficient. 



To better understand the role of interference alignment in the index coding problem, let us examine 
a linear index coding achievable scheme. For simplicity of exposition, let us consider a symmetric rate 
setting, i.e., L m = L, Vm € M. For any subset of the set of messages B c W, we use the following notation. 

Vg = {v e F" : v is a column vector of Vj, i e 

i.e., is the set of all column vectors which belong to at least one matrix Vj : i £ B. 

Receiver k receives a n dimensional vector S n which is a linear combination of the M L column vectors 
of Vyy • If all the M L column vectors are linearly independent, then, clearly the desired signal is resolvable 
and a symmetric rate of 1 /M per message is achievable. Note that this is the rate achieved by routing; the 
routing solution is in fact one such realization of Vyy ■ In general, however, the messages may be resolvable 
even if the M L column vectors of Vyy are linearly dependent, because of the presence of antidotes. In fact, a 
rate greater than 1 /M is possible only by making the column vectors of Vyy linearly dependent. Consider, for 
instance a receiver, say destination Dk that wants to decode messages Wk and has antidotes for messages 
Ak- It receives the linear combination of ML vectors in an n dimensional space, of which it can cancel 
the impact of |^4fe|I/ vectors, Vj^ k , using the antidotes. Therefore, from the perspective of destination Dk, it 
observes (after cancellation), the linear combination of |Wfc|£ desired vectors along Vw k , and [M — \Ak\ — 
\Wk\)L interfering vectors along the columns of Vw-(A k uw k )- A necessary condition for the resolvability of 
messages Wk at destination Dk can be expressed as 

span(V w J n span(V W -(^ fc uw fc )) = {°l ( 5 ) 

This means that a necessary condition for resolvability at destination Dk, is that the dimension of interfer- 
ence span(Vyy_(.A ( .uw fc )) should be smaller than n — |Wfc|£ (because the vectors are all observed in an n 
dimensional space). Clearly, if n — \ Wk\L < |W — (Ak U Wk)\L, then, the interfering vectors need to align in 
an n — \ Wk\L dimensional space. 

Next we illustrate the role of interference alignment in index coding, with a series of examples presented 
in increasing order of complexity. 

3.1 Example 1: Scalar linear index coding with One-to-One Alignment 



S = xiVi + {x 2 + x 3 )V 2 




Figure 4: A simple example where interference alignment is useful in the index coding problem. In the example, the 
alignment is realized with V2 = V3 . The alignment enables transmission of 3 scalars to the 3 corresponding users in a 
2 dimensional vector space, ensuring that a rate of 1/2 is achievable. The field F can be chosen arbitrarily. 

A simple interference alignment solution for an index coding problem is demonstrated in Figure |4j 
where K = 3 and each user sends L = 1 vector. Because of alignment of V 2 and V 3 , user 1 is able to 



resolve X\. For instance one may choose Vi = [0, 1] T , V 2 = V 3 = [1, 0] T , so that the two transmitted symbols 
on the bottleneck link are S 2 = (Si, S2) = (^2 + £3, x\), from which each destination is able to recover its 
desired message. Since only one symbol is sent per message, this is an example of a scalar linear index 
coding solution. Furthermore, since the alignment of vectors takes place in a one-to-one fashion, i.e., V2 
aligns with V3, we refer to this as a one-to-one alignment solution, to be distinguished from the subspace 
alignment solutions to be presented soon. 

3.2 Example 2: Vector linear index coding with One-to-One Alignment 

The next index coding example comes from the CBIA setting shown in Figure|3|a), (b), (c). Depending upon 
whether each base station transmitter has a message only for one corresponding receiver, or an independent 
message for each of the receivers that are within-range, we have the interference channel or the X channel 
setting, respectively. Here we consider the interference channel setting. 




(a) (b) (c) 

Figure 5: Index coding problem corresponding to a CBIA interference channel setting, (a) Desired message and anti- 
dote sets, (b) Capacity optimal solution shows the 5 dimensional S 5 space, along which 2 symbols per message are sent 
with one-to-one pairwise alignments along the 5 orthogonal basis vectors. The field F can be chosen arbitrarily. 

The interference channel setting means that base station (source) i, 1 < i < 5, has only one message, Wi, 
for its corresponding receiver (destination) i. So there are M = 5 distinct messages and K = 5 destinations. 
The goal is to achieve a symmetric rate of | per message, which is also the capacity optimal solution. The 
problem is essentially identical to the setting considered in |18| to show that vector linear index coding 
outperforms scalar linear index coding. The achievable scheme is a vector linear scheme operating over 5 
channel uses. The precoding vectors V;, 1 < i < 5 are 5x2 matrices and is a 2 x 1 vector X^ = i,x; 2] T 
representing W^. Therefore, the transmitted symbol sequence S 5 is 

5 

S 5 = J2Vi*i (6) 

i=l 

To see why interference alignment is necessary, note that each destination has access to the 5-dimensional 
symbol S 5 , and it also observes 2 messages as cognitive information. After removing the known streams 
from S 5 the receiver is left with 6 remaining unknown symbols in a 5-dimensional space. Since each destina- 
tion is interested in two desired streams, these desired streams span 2-dimensional space and the remaining 
4 streams that constitute interference, must align in way that they occupy at most a 3-dimensional space. 

Suppose Ti,T2,T3,T4,T5 are 5 linearly independent vectors over the 5-dimensional space. These 5 
vectors can be chosen to be the columns of 5 x 5 identity matrix. At destination 1, messages W2 and W5, each 



composed of two independent scalar streams, should align such that they together occupy a 3 dimensional 
space. One way to do so is to perfectly align the precoding vector of one of two streams of W2 with one of 
two streams of W5, e.g., V2.2 = V5.1 = TV Similarly, in order to satisfy the alignment constraint at all the 
destinations, the precoding vectors are chosen to be 
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T 3 , Vi ; 



■■ Ta, V 4 



(7) 



Thus, the 4 undesired signal vectors are aligned at each destination such that they occupy only 3 di- 
mensions. Now what we need to prove is the resolvability of desired messages at the corresponding desti- 
nations. They are resolvable because at each destination i the desired messages are received in a space T>i 
which is linearly independent from the space of interfering messages called 2, . 
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(8) 



Finally, if we choose the receiver combining matrices as follows, we can easily verify that both property 1 
and property 2 are satisfied 



At destination 1: Ui 1 



At destination 3: U 3 



At destination 5: U 



5,5 



10 

1 

10 

10 

1 
10 



At destination 2: U 2 



At destination 4: U4 



1 

1 

1 

1 



(9) 



Note that the field can be chosen arbitrarily, i.e., any choice of F works for the linear solution presented 
above. The solution also translates to the real or complex fields, thereby establishing the DoF for the corre- 
sponding CBIA problem as 2 /5 per message. 



3.3 Example 3: Scalar linear index coding with Subspace Alignment 

Like the previous example, this example also corresponds to the CBIA problem in Fig. |3ja),(b),(c). The 
difference is in the message sets. While in the previous example, each base station served only one receiver, 
here we assume that each base station has 3 independent messages, one for each of the 3 receivers that are 
within receiving range of the base station. In the parlance of wireless networks, while the previous setting 
is an interference network, the current setting is an X network. In an X network, there is an independent 
message to be communicated between each transmitter-receiver pair that are within range of each other, 
i.e., have a non-zero channel coefficient between them. 



The index coding problem for the X network setting is also well defined. There is an independent mes- 
sage between each source and destination pair that are not connected via an antidote link. The specific 
index coding problem that we solve in this example is shown in Figure [6] Notice that an X network setting 
is a mulitple unicast setting, since each message has a unique source and a unique destination. It is also 
sometimes referred to as the all unicast setting, since here all possible non-trivial unicast flows are simulta- 
neously active (the unicast flows between source destination pairs connected by infinite capacity antidote 
links are ignored because they trivially have infinite rate). Also note that as always it is possible, without 
loss of generality, to represent this index coding problem with only one message per source and only one 
message per destination by increasing the number of sources and destinations, but we prefer the compact 
representation shown in Figure [6] which directly reflects the X channel setting. X networks often lead to 
interesting interference alignment problems. Indeed, that is the case with this example as well, where one- 
to-one alignment does not suffice and the optimal index coding scheme is a scalar linear subspace alignment 
scheme. Next we proceed to describe the alignment solution. 

S 6 

+ Xio + %13 + X 14 + Xis 
+ Xio + Xu + Xu + #15 
+ Xio + Xu + X12 + Xi 5 
+ X 8 - Xu) - Xis 
+ Xg + X$ — Xio ~ Xu — Xi5 
+ X 6 + X$ + X14 

(a) (b) 

Figure 6: Index coding problem corresponding to a CBIA X channel setting, (a) Desired message and antidote sets, 
(b) Capacity optimal solution shows the 6 dimensional S 6 space, along which 15 symbols are sent. The field F can be 
chosen arbitrarily. 

There are M = 15 distinct messages and K = 5 destinations. Each black square on the left is a base 
station (source) sending 3 distinct messages. Base station (source) i, 1 < i < 4 sends messages W3i+i:3i+3 
and the 5-th base station sends Wi,s. We have 
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Our goal is to achieve the symmetric rate of | per message, which is also the capacity of this network 
(the outer bound follows from a subsequent solution of a broader class of X networks, presented in Sec- 
tion |433). The achievable scheme is a scalar linear achievable scheme over 6 channel uses and hence the 
precoding vectors Vj, 1 < i < 15, are 6x1 vectors. Therefore, the transmitted symbol sequence S 6 is 

15 

S 6 = J2Vi x i ( 10 ) 

i=l 




where the scalar symbol x< represents message Wi. To see why interference alignment is necessary note that 
each destination has access to the 6-dimensional symbol S 6 , and it also observes 6 messages as cognitive 
information. After removing the known symbols from S 6 the receiver is left with 9 remaining unknown 
symbols in a 6-dimensional space. Since each destination is interested in 3 desired symbols, they must 
occupy a 3-dimensional space, leaving only 3 dimensions within which the remaining 6 undesired symbols 
that constitute interference, must align. 

Suppose Ti , T 2 , T 3 , T 4 , T 5 , T 6 are 6 linearly independent vectors over 6-dimensional space. These 6 vec- 
tors can be chosen to be the columns of 6 x 6 identity matrix. Since W3 5.7 are desired at destination 1, they 
should span a 3-dimensional space and therefore are sent over 3 linearly independent vectors Ti,T2,T3. 
On the other hand, W24 are the interfering messages at destination 1 and both are desired at destination 5. 
So these two messages should be sent over vectors that are linearly independent of each other and linearly 
independent from T 1 ,T 2 ,T 3 . Therefore, we send W 2l W± over T 4 ,T 5 , respectively. Also W\ is desired at 
destination 4 and is considered as an interfering message at both destination 1 and destination 5. So W\ 
should be sent over a vector that is linearly independent from Ti , . . . , T5 and is chosen to be sent along Tg. 

Interfering messages WijWa, W4 occupy a 3-dimensional space at destination 1. The remaining inter- 
fering messages at destination 1, i.e., We,8,9/ should be sent along precoding vectors chosen such that they 
remain in the same span as span(T 4 ,T 5 ,T 6 ), i.e., 



span(V 6 ,V 8 , V 9 ) € span(T 4 ,T 5 ,T 6 ) 



(11) 



One way to satisfy | [TT| | is by choosing = Tq, Vg = T5. The remaining precoding vector is designed later 
to satisfy the following 



At destination 1 : span(V 8 ) € span(T 4 ,T 5 ,T 6 ) 



(12) 



Similarly, to satisfy the requirement of aligning interfering messages in 3-dimensional space at different 
destinations, we design the precoding vectors as follows: 



At destination 2 
At destination 3 
At destination 4 
At destination 5 



leading to the formulation 



span(Vn) e span(T 2 ,T 3 ,T 5 ) 
span(Vi 4 ) e span(V 8 ,T 3 ,Vi ) 
span(V 2 ) = span(T 4 ) € span(Vu, Vi ,Ti) 
span(V 5 ) = span(T 2 ) e span(V 14 ,T 1; T 6 ) 
V 12 =T 3 , V 15 = V 10j Vi3=Ti 



V 8 = aiT 4 + a 2 T 5 + a 3 T 6 
Vn = b{Y 2 + b 2 T 3 + 6 3 T 5 
V10 = C1V14 + c 2 T 3 + c 3 V 8 
V10 = diVn + d 2 J 1 + rf 3 T 4 
V14 = e x Ti + e 2 T 2 + e 3 T 6 



(13) 
(14) 
(15) 
(16) 
(17) 



(18) 
(19) 
(20) 
(21) 
(22) 



where 01, b\, . . . , d 3 , e 3 are linear combination coefficients. In order to satisfy 1 18 1-|22 \, the only thing that 
restricts us from choosing the linear combination coefficients to be random is that V 10 should satisfy both 
( p0| and ||2Tj. If we substitute from l[l8j,l[l9} and p2) into l(20) and |(2lJ, Vi should satisfy the following two 
equations 



V10 = Cie 1 T 1 + Cie 2 T 2 + c 2 T 3 + c 3 aiT 4 + c 3 a 2 T 5 + (c 3 a 3 + Cie 3 )T 6 
V10 = d 2 Ti + C/161T2 + dib 2 T 3 + d 3 T 4 + dib 3 T 5 



(23) 
(24) 



To satisfy |23| and 1 24 ), we have the following 



d- 2 = c\e\ 
d\bi = ae 2 
dib 2 = c 2 
d% = c 3 ai 
dib 3 = c 3 a 2 
C3»3 + cie 3 : 







(25) 
(26) 
(27) 
(28) 
(29) 
(30) 



Clearly, there are many solutions. One of the solutions for these system of nonlinear polynomial equations 

is a± = a 2 = a 3 = b\ = 62 = c\ = c 2 = d\ = d 2 = &i = e 2 = e 3 = 1, 63 = c 3 = d 3 = — 1, according to which, the 
precoding vectors are chosen as follows 



V 8 
Vu 
V 14 
V10 



T 4 - 
T 2 - 
Ti- 
Ti- 



T 5 - 
T 3 
T 2 
T 2 



T 6 
T 5 
T 6 
T 3 



Note that this is not a one-to-one alignment solution, e.g., V 8 does not align with T 4 ,T 5 ,T 6 individually. 
In fact it is pairwise linearly independent of all three. V 8 aligns only within the subspace spanned by 
T 4 , T5, Tq. This is referred to as subspace alignment. 

After satisfying all the alignment constraints, we need to prove the resolvability of desired messages at 
the corresponding destinations. They are resolvable because at each destination i the desired messages are 
received in a space called T>i which is linearly independent from the space of interfering messages, li. 
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Finally, the receiver combining matrices are chosen as follows, so that both property 1 and property 2 are 
satisfied 

At destination 1: U 3) i = [1 0], U 5) i = [0 1 0], U 7 ,i = [00 1 0] 

At destination 2: U 6 . 2 = [-10 - 1 1], U 8 , 2 = [1 00 1 00], Ui , 2 = [1 00] 

At destination 3: U 9 , 3 = [01001 - 1], Un, 3 = [01010 - 1], U 13)3 = [10010 - 1] 

At destination 4: Ui 2 , 4 = [001010], U w , 4 = [010010], Ui, 4 = [0-100-11] 

At destination 5: U 15j5 = [0 1 0] , U 2 , 5 = [0 1 1 0] , U 4:5 = [001010] 

Note that the field can be chosen arbitrarily, i.e., any choice of F works for the linear solution presented 
above. The solution also translates to the real or complex fields, thereby establishing the DoF for the corre- 
sponding CBIA problem as 1/6 per message. 

4 Background and Results 

In this section we present the results of this work along with the relevant background comprised of related 
prior work. Our first result is the insufficiency of linear codes for the multiple unicast index coding problem. 

4.1 Insufficiency of Linear Codes for Multiple Unicast Index Coding 

The index coding problem was introduced in the multiple unicast setting by Birk and Kol in [2] and sub- 
sequently studied by Bar-Yossef et al. in Ifl9ll , where it was shown that the minimum number of channel 
uses required to send 1 bit per message with the optimal binary scalar linear index coding scheme is equal 
to the minrank function. Further, as noted in [20], this result can be generalized by allowing the coding to 
be performed over an arbitrary finite field. Algorithms for finding or approximating the minrank function 
are proposed in Ell and 11221 . Haviv and Langberg in 1 23 J investigated the min-ranks of random digraphs. 
Within the class of scalar index coding schemes, the insufficiency of linear codes was established by Lubet- 
zky and Stav in lf24l . Alon et al. further established the sub-optimality of scalar linear solutions in |18| by 
highlighting the benefits of vector linear coding. Since vector linear coding is a much more powerful form 
of linear coding than scalar linear coding, its optimality relative to non-linear coding schemes has been 
a topic of great interest, and has produced a series of results in general contexts that have progressively 
closed in on the index coding problem. The series of breakthroughs started with the work of Dougherty 
et al. in |25| who used the connections between the network coding problem and representability of ma- 
troids to establish a gap between vector linear coding and non-linear coding for the multiple unicast network 
coding problem. However, the gap result of Dougherty et al. did not apply to the index coding problem 
which is a special case of the network coding problem. Building upon the work of |25J, Rouayheb et al. 
established connections between the representability of matroids and the multiple groupcast index coding 
problem. Most recently, Blasiak et al. further developed this relationship in |26], and used the construction 
of lexicographic products of graphs to show that for the multiple groupcast index coding problem, vec- 
tor linear codes are strictly out-performed by non-linear codes. However, the gap result for the multiple 
groupcast index coding problem does not apply to the multiple unicast index coding problem. Hence, in the 
original setting of multiple unicast index coding, which is also the most commonly studied form of index 
coding, the optimality of linear codes remains open. 

Our first result, presented in Theorem [Tl settles this issue. 



Theorem 1 Linear coding is insufficient to achieve the capacity region of the multiple unicast index coding problem. 

The proof of Theorem [T] is presented in Section [5] The key to this result is the construction of an equiva- 
lent multiple unicast problem for an arbitrary groupcast setting. Loosely speaking, this equivalent multiple 
unicast setting has the following two properties: 

• Any rate that is achievable with linear coding in the groupcast setting is also achievable with linear 
coding in the multiple-unicast setting and vice-versa. 

• Any rate achievable in the groupcast setting (using any achievable scheme including possibly non- 
linear strategies) is achievable in the multiple unicast setting. 

Since [26 1 has a construction of a groupcast index coding problem where linear coding schemes are insuffi- 
cient, the above conditions imply that linear coding is insufficient in the equivalent multiple unicast setting 
as well. 

The construction of the equivalent multiple unicast index coding problem is described next. The basic 
idea of the construction is to replace each message that is desired by multiple destinations, with a new 
set of independent messages, one for each original desired destination. Thus, each destination desires 
only one of these independent messages (and has the rest as antidotes), giving us a unicast setting. The 
equivalence to the original groupcast setting is enforced through a requirement that these independent 
messages are to align into the same space as the original message that they replace. This is accomplished 
by introducing auxiliary messages and destinations (that contain the subscript in the description below), 
one for each original message. Each auxiliary destination sees only one set of messages as interference (has 
all other messages as antidotes) and its desired auxiliary message leaves only as many signal dimensions 
for interference as each of the members of the set, thus forcing them into alignment. It is this alignment that 
allows the optimal solution in the multiple groupcast setting to be used in the multiple unicast setting and 
vice-versa. 

Construction 1 Consider an arbitrary groupcast index coding problem where there are M messages and K destina- 
tion nodes. Without loss of generality^ we assume that 

• each message is desired by L destinations so that K = LM, and 

• each destination desires exactly one message. 

In this groupcast index coding problem, the set of messages, the set of destinations, and the set of antidotes are 
respectively denoted by W, V, {Ak ■ k = 1, 2, . . . , K}, where 

D k = ({W m },A k ),m=\k/L-\ 

We now construct an equivalent multiple unicast index coding problem as follows. The multiple unicast setting 
has LM + M messages and K + M destinations. In this multiple unicast index coding problem, denote the messages 
as 

W = {^1,0,^1,1,^1,2,. • .,^1,^,^2,0,^2,1, . . .,W 2 ,L,- ■ .,WMfi,W M ,l,WM t 2, ■ ■ -,W M ,l}- 

The destinations in the multiple unicast system are denoted as 

Di,j ■ i = 1,2, ...,M,j = 0,1,2,..., L 

and the antidotes are denoted as 

A id ,i=l,2,...,M,j = 0,l,2,...,L 

2 There is no loss of generality in the first assumption because if there is a message desired by L' < L destinations, then we can 
add L — L' virtual destinations with antidote sets identical to any of the V (original) destinations, and which desire the appropriate 
subset of the messages desired by the original destinations. Similarly, there is no loss of generality in the second assumption because 
if a destination desires multiple messages, it can be equivalently replaced by multiple copies of itself, each interested in only one of 
the originally desired messages. 
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(b) 



Figure 7: (a) Groupcast index coding where M — 2, K — 3 and each message is desired by two destinations (b) 
Equivalent multiple unicast index coding problem 



so that destination D it j — (Wij,Aij). The set of antidotes is 



A% 



where 



A(i-i)L +j ,. U W.,o U {W t j : I ? j}, j^O 
W-W,, j = 



A k ,. = {W m ,i :W m eA k ,l = 0,1,2,..., L} 

W.,o = {W lt0) W 2 , 0) ~- ,W M .o} 
W k ,. = {W k , ,W k , u --- ,W k , L } 



(31) 



(32) 
(33) 
(34) 



As an example, a groupcast index coding problem and its equivalent multiple unicast index coding 
problem are shown in Figure [7] 

Theorem 2 For every groupcast index coding problem, the equivalent multiple unicast index coding problem speci- 
fied by Construction^ satisfies the following properties: 



Property 1: Rate tuple 1Z = {R\,R%, . . . Rm) is achievable in the groupcast setting only if the rate tuple 

R _/ Ri, jVO 
* J ~ 1 i-Ri, .1 = 

is achievable in the equivalent multiple unicast problem. 



(35) 



Property 2: Rate tuple TZ = (i?i, i?2, ■ • • Rm) is achievable via linear coding in the groupcast setting if and 
only if the rate tuple 

Rij = \ i (36) 



1-Ri, j = 

is achievable via linear coding in the multiple unicast problem. 



The proof of Theorem [2] is presented in Section |5j where we also argue that it implies Theorem [T] Here, we 
summarize the intuition behind the proof. In the groupcast setting, a message W m is desired to be decoded 
by L destinations, £>( m -i)z,+i>-D(m-i)L+2> • ■ ■ , D m L. The equivalent multiple unicast construction described 
formally above can be intuitively viewed as one obtained by expanding the message W m into L indepen- 
dent messages W my i,W m ,2, W mt L — each of these L messages desired uniquely by one destination 
as required for a multiple unicast index coding problem. Because of this construction, there is a corre- 
spondance between destination D( m _ 1 ^ L+ j in the groupcast setting, and destination D m j in the equivalent 
multiple unicast setting. This correspondance is maintained in the antidote structure which is carried over 
from the groupcast setting to the multiple unicast setting. In particular, if a message Wi is present as an an- 
tidote at destination D( m _x)£ +J - in the groupcast setting, then, destination D m j has W^i, Wj,2) • • • , W^l as 
antidotes in the multiple unicast setting (and vice-versa). In addition, destination D m i has as antidotes, all 
the messages W m ^,1 ^ I. This antidote structure allows all messages in the set {W mt x, W m p, . . . , W m ^} 
to occupy the same "space" in a linear coding scheme, since a destination that desires any one of these 
messages, has all the other messages in this set as antidotes. In addition, because the antidote structure 
from the groupcast setting is carried over to the multiple unicast setting, for any achievable scheme in the 
groupcast setting, the messages W mi i,W m> 2, ■ ■ ■ , W m ,L can occupy (i.e. align in) the same "space" occupied 
by W m in the groupcast setting. This means that any achievable scheme in the groupcast setting naturally 
translates to the multiple unicast setting. For the converse establishing equivalence (in Property 2), we also 
need to show that a scheme achieving rate R m for messages W m ,i, ■■•> W„ ly L in the multiple unicast setting 
can be translated to a scheme achieving rate R m for message W m in the groupcast setting. To ensure this, 
we use the auxilliary destination D m o in the multiple unicast construction. The message W m ,o desired by 
this auxiliary destination is provided as an antidote to all the other destinations and therefore does not af- 
fect achievability of rate R m for any other destination. Destination D m0 has as antidotes, all the messages 
except W m ,o-,W mt i, . . . , W m ,L- This means that the space occupied by W m .o has to be linearly independent 
of W m> i, W m ,2, ■ ■ ■ W mi L- Now, if this message W m ,o has a rate of 1 — R m , then the interfering messages 
faced by this destination - W m> \, W m ^ , W m ,L - have to together occupy a space of dimension nR m . This 
implies that if each of these interfering messages have to achieve a rate R m , they have to align (nearly) per- 
fectly. This aligned space can be used in the groupcast setting to encode W m at rate R m enabling translation 
of achievable scheme from the multiple unicast setting to the groupcast setting. Equivalence for non-linear 
schemes is based on random coding arguments and superposition coding for auxiliary messages, according 
to the detailed proof presented in Section [5] 

4.2 Feasibility of Symmetric Rate when V/c, > L 

Given that Vfc, | Wfc | > L, without loss of generality we can assume that \Wk \ = L, V k € /C, i.e., each destina- 
tion is interested in decoding exactly L distinct messages, i.e., Wk = {Wk ± , Wk 2 , ■ ■ ■ , Wk L }, Vfc; € M. This is 
because destinations that wish to decode more than L messages can be split into multiple destinations with 
the same set of antidotes, that each wish to decode a subset of size L of the original messages, such that the 
union of these subsets is the original set of desired messages. 

If M — L or M = L + 1, it is easy to achieve rate per message by sending each message separately at 
each time and each destination can achieve rate or jxj, respectively, which are both greater than or equal 
to and hence rate per message is feasible. If M > L + 1, intuitively, since all desired messages must 
pass through the bottleneck link of capacity 1 and all messages must simultaneously achieve rate each, 
then some overlap of signal dimensions within the bottleneck symbol S n is unavoidable. The interfering 
messages that are available through antidote links can be subtracted. The desired signals consume a fraction 
of the capacity of the bottleneck link, which must be free from interference. This leaves only the 
remaining of the signal space for interference within which all interfering messages, each of which 
carries rate jxjv should overlap nearly perfectly. The intuitive explanation is formalized for all possible 
coding schemes through a Shannon theoretic framework. 

The following terminology is introduced specifically for the setting where each message wants to achieve 
rate j^. 



k k 

• Alignment Relation: We define a relation W t o W 3 as follows. W t -h- Wj iff W, ^ Ah, Wi f. Wk, 
Wj Ak and Wj Wk for k e K and distinct indices i, j G M. In the -^px rate feasibility problem, 

k l 

the relation Wi «-» Wj represents the understanding that Wi and Wj must align (into of the signal 

space within the bottleneck link in order to leave the remaining of the signal space for Wk)- We 
may occasionally use the notation Wi <-» Wj when the identity of the destination is not important. 

• Alignment Subsets: The set of messages W is partitioned into alignment subsets, created as follows. 
If Wi O Wj , then both Wj, Wj belong to the same alignment subset. Further, if Wi O Wj and Wj <H- 
W m then Wi, Wj, W m all belong to the same alignment subset. For the jxj rate feasibility problem, 
we expect that the messages within an alignment subset will need to align almost perfectly within the 
bottleneck link signal space. 

As an example, consider an index coding problem shown in Fig. [8j where Wi = { W\ , W 2 } , W2 — { Wi , W 3 } , W3 = 

{W 2 , Wa},A\ — 0,A2 = {Wi},A3 — {W 3 }. In this example, the alignment relations are W 3 «-» W 4 and the 
alignment subsets are {Wi}, {W2}, {W3, W4} and the rate | per message is feasible. 




Figure 8: Groupcast index coding problem with M = 4, K = 3 and L = 2 where the rate | per message is 
feasible 

Based on this definition for the alignment subset, we have the following theorem for the j-W rate feasi- 
bility. 

Theorem 3 The rate tuple TZ with Ri = R 2 = ... = Rm = j^i is not achievable in the multiple groupcast index 
coding problem where \ Wk \ = L, V k £ K, if and only if there exist distinct indices i,j e M such that Wi, Wj belong 
to the same alignment subset and Wj <E Wk and Wi £ Akfor k e K, 

The statement of Theorem [3] is intuitively interpreted as follows. If messages Wi, Wj belong to the 
same alignment set then they should overlap almost perfectly. Because of their overlap, it is not possible 
to recover one of them unless the other message is available as an antidote. If such an antidote is not 
available then the achievability of rate for every message becomes infeasible. Note that the feasibility 
condition refers to all possible coding schemes and not just linear coding schemes. The proof of Theorem|3] 
is presented in Section [6] 

As a special case of Theorem |3j for L = 1, we recover the feasibility condition for achievability of rate 
half per message, previously obtained by Blasiak et al. in l27l . 

Corollary 1 The rate tuple TZ with R 1 = R 2 = ... = R M = i is not achievable in the multiple groupcast index coding 
problem where each destination desires only one message, if and only if there exist distinct indices i,j G K. such that 
Wi, Wj belong to the same alignment subset and {Wj} — Wk and Wi Akfor k G /C 



The rate-half feasibility condition of Corollary [T]has previously been presented by Blasiak et al. in [27] 
using graph theoretic terminology which involves notions such as almost-alternating cycles and graph- 
compatible functions. While the two results are essentially identical, viewing the problem through the 
lens of interference alignment allows a much more intuitively transparent statement in terms of alignment 
subsets. 

While the feasibility conditions are not limited to linear schemes, remarkably, linear coding is sufficient 
for achievability for feasible settings. The following theorem further elaborates on the linear coding scheme 
and the required field size. 

Theorem 4 Whenever rate half is feasible according to Corollary^ it can be achieved through: 

1. scalar linear coding over 2 channel uses if the finite field is large enough. Specifically, |F| > Z. 

2. vector linear coding over a given finite field F if the number of channel uses n is large enough. Specifically, 
n/2> log| F | Z and n is even. 

where Z is the number of alignment subsets. 



Proof. From the detailed proof of achievability of Theorem [3] presented in Section 6.1 it is evident that 



what is needed for half-rate achievability is simply a one-to-one mapping, from each alignment subset, 
to an ri/2-dimensional sub-space of an n-dimensional vector space, such that the subspaces assigned to 
any two alignment subsets are non-intersecting. The number of pairwise non-intersecting A/ t -dimensional 
subspaces of an m-dimensional vector space over a field F is shown by (281 to be ff" t ~\ whenever M t 
divides to. For rate-half achievability, we have m — n, M t — n/2, and the result of Theorem|4]follows. 

Note that while it clearly takes only polynomial complexity to identify the alignment subsets (also 
pointed out by Blasiak et al. |27|), finding the minimum possible number of alignment subsets is much 
more challenging. This is because it is generally possible to further consolidate alignment subsets as long 
as alignment constraints are not violated. In other words, two alignment subsets may be combined if there 
does not exist a message in either subset that cannot be aligned with a message in the other subset. Recall 
that two messages cannot be aligned if one of them is desired at any destination that does not have the 
other message as an antidote. Finding the minimum number of alignment subsets for a feasible problem is, 
however, NP-hard. As a consequence, deciding the feasibility of e.g., rate half, when both the field F and 
the number of channel uses n is fixed, is NP -complete. This is noted specifically in the context of rate-half 
feasibility over a binary field and 2 channel uses, by Dau et al. in [29 1. On the other hand, as noted above, 
if either the field size or the number of channel uses can be chosen to be large enough, then there is no 
need to consolidate the number of alignment sets, and determining the feasibility of rate half involves only 
polynomial complexity. 



4.3 Symmetric Instances of the Multiple Unicast Index Coding Problem 

As with most multiuser capacity problems, part of the difficulty of the index coding problem lies in the 
potentially unlimited number of parameters in the number of users and the desired and antidote message 
sets for each user, which makes a systematic study difficult. In order to limit the number of parameters 
while still covering broad classes of index coding problems, in this section we study symmetric instances 
of the multiple unicast index coding problem, e.g., where relative to its own position, each destination is 
associated with the same set of desired and antidote messages. 

Solutions to the multiple unicast index coding problem have been found for a variety of symmetric 
settings. Since here M. = K. and destination k desires only message Wk, the problem can be represented 
by a directed graph G on the vertex set M., in which a vertex i is connected to a vertex j if and only if the 
destination Di knows Wj. Bar-Yossef et al. in [20 j found the optimal symmetric rate for directed acyclic 
graphs, perfect graphs, odd holes (undirected odd-length cycles of length at least 5) and odd anti-holes 
(complements of odd holes). Also, Blasiak et al. in [27] found the capacity per message of the following 
symmetric index coding instances (the parameters will become clear later on in this section). 



• Neighboring antidotes where D = U, showing that the sum capacity is per message. 

• Ak — {Wk+i,W k+K / 2 } for even K, showing that the sum capacity is j| per message. 

• Neighboring interference for arbitrary K where U = D = 1 and showing that the sum capacity is 

IK/21 

1 L J per message. 

Berliner and Langberg in |30] characterized the solution of index coding problems with outerplanar 
side information graphs in terms of the clique cover size of the information graph where the encoding 
functions are (scalar) linear. Ong et.al in [31 1 defined uniprior index coding problems as the case where 
Ai fl Aj = for i 7^ j and single uniprior as the case where | Ai |= 1, Vi G fC. They derived the optimal 
symmetric rate for all single uniprior index coding problems. 

In the following subsections, we present our capacity results for various symmetric classes of the index 
coding problem, mostly inspired by the natural settings for the CBIA problem. 



4.3.1 Neighboring antidotes 

Consider a symmetric multiple unicast index coding problem where each destination has a total of U + D = 
A < K antidotes, corresponding to the U messages before ("up" from) and D messages after ("down" from) 
its desired message. For this setting, we state the index coding capacity in the following theorem. 

Theorem 5 The capacity of the index coding problem with M = K < oo, (all subscripts modulo K) 

D k = {{W k },{W k -u,W k - U+ i,--- ,Wfc_i}U{WjH-i,Wfc +2) --- ,W k+D }) (37) 

and 

U,DeZ (38) 
< U < D (39) 
U + D = A < K (40) 

is 

r _r i, a=k-i 

I K~A+2U> A - A Z 

per message. 

As an example, consider the K = 5 user setting with A = 2 antidotes, U = D = 1, where the capacity 



is 2/5 per message. Incidentally, example 2 in Section 3.2 also has K = 5 users, A = 2 antidotes, and 
has the same capacity of 2/5 per message. In general, however, for a fixed number of users, K, and a 
fixed number of total antidotes, A, the capacity per message depends on the relative position of antidotes. 
Notably, for fixed K, A values and neighboring antidotes as considered in Theorem|5j the capacity improves 
as the number of antidotes on either side becomes more evenly distributed, i.e., as D — U becomes smaller. 
The best case setting, i.e., the setting with the highest capacity, is when the antidotes are symmetrically 
distributed on both sides, e.g., A = 2U = 2D, (for even ^4) which leads to a capacity of 4^ per message. 
The worst case setting is when the antidotes are all on the same side, i.e., A = D, U = 0, which leads to a 
capacity of K \ A per message. No interference alignment is needed in the latter case. 

The achievability scheme in general is a vector linear coding scheme with one-to-one alignments, where 
each message is sent through U+1 scalar symbols over a U + 1 dimensional signal space. Adjacent mes- 
sages overlap in U dimensions. Because of this interference alignment, at any receiver, the total number of 
signal dimensions occupied by the K — A — 1 interfering messages is equal toll + (K — A — 1). The U + 1 di- 
mensional desired signal space is chosen to not have an intersection with the interference space, so that the 
dimension of the total space, i.e., the number of channel uses n equals U + (K — A — 1) + U + 1 = K — A + 2U 
and the capacity (normalized by the number of channel uses) is (U + 1)/(K — A + 2U) per message. The 
details of the achievability proof and the converse are presented in Section [7| 



4.3.2 Neighboring interference 



Consider the following CBIA setting. We have a locally connected network with M = K = oo where each 
receiver k has only one desired message W k from its corresponding (base station) transmitter k. There are 
totally U + D + 1 transmitters with non-zero channel coefficients to receiver k. One of them is the desired 
transmitter k, U of them are the transmitters before (up from) and D of them are the transmitters after (down 
from) the desired transmitter, as illustrated in Fig. [9] Basically what is sent over the U + D neighboring 
links constitutes interference for receiver k. The index coding problem for this locally connected network 
has the antidote graph that is the complement of the connectivity graph shown in Fig. [9] 



Transmitters that arc 
heard by Receiver k 




\ W k+D 
W k+D+1 



Figure 9: CBIA setting: Locally connected network representation for multiple unicast with neighboring interference. 
The corresponding index coding problem has the complementary antidote graph, i.e., there is an antidote link for each 
transmitter receiver pair that are not connected above, and there is no antidote link for each transmitter receiver pair 
that are connected in the picture shown above. 

For such a network, we have the following: 
Theorem 6 The capacity of the index coding problem associated with Fig. [9] where M = K = oo, 

Dk = ({W k },{ W k-U,k-U+l,-- ,k-l : k,k + l.k+2,-- .k+D}) (4 2 ) 

and 



U,DeZ 
< U < D 



(43) 
(44) 
(45) 



is 



per message. 



C 



1 

D + l 



(46) 



Evidently, if each destination has totally L = U + D missing antidotes corresponding to the U messages 
before (up from) and D messages after (down from) its desired message, the best case setting, i.e., the 
setting with the highest capacity, is again when the missing antidotes are symmetrically distributed on 
both sides, i.e., U = \_^\ and D = [~~] , which leads to a capacity of l/( +1) per message. The worst case 
setting is when the missing antidotes are all on the same side, i.e., D = L, U = 0, which leads to a capacity 
of per message. No interference alignment is needed in the latter case. 




Figure 10: CBIA setting: X network setting with local connectivity representation. The corresponding index coding 
problem has the complementary antidote graph, i.e., there is an antidote link for each transmitter receiver pair that are 
not connected above, and there is no antidote link for each transmitter receiver pair that are connected in the picture 
shown above. 



The achievability scheme in general is a scalar linear coding scheme with one-to-one alignments, where 
each message is sent through one scalar symbol over a D + 1 dimensional signal space. U messages before 
(up from) each desired message are aligned with the last U messages among D messages after (down from) 
that desired message, respectively. Because of this interference alignment, at any receiver, the total number 
of signal dimensions occupied by the D + U interfering messages is equal to D. The one dimensional 
desired signal is chosen to not have intersection with the interfering signal space and over D + 1 channel 
uses, it is resolvable. The details of the achievability proof and the converse are presented in Section [8] 

4.3.3 X network setting with local connectivity 

Consider the following CBIA setting. We have a locally connected network where each destination is con- 
nected to L consecutive base stations. Suppose it is the X network setting where each base station has a 
distinct message for each connected destination. Without loss of generality, we can rename the destinations 



such that destination i is connected to sources i,i + 1, . . . ,i + L — las depicted in Figure 10 The index 



coding problem for this locally connected network has the antidote graph that is the complement of the 



connectivity graph shown in Fig. 10 For such a network, we have the following: 



Theorem 7 The capacity of the symmetric index coding problem with M = KL and M, K — > oo, where 

Wfc = {WkL,kL+L-l,(k+l)L+L-2,--- ,(k+i)L+L-i-l,— : (fc+L-2)L+l} 5 

Ak = {W(k-l)L+\;{k-\)L+L-l,— ,(k+i)L+l:(k+i)L+L-i-2,(k+i)L+L-i:(k+i)L+L,— ,(k+L-2)L+2:(k+L-2)L+L U Wfc} C 



IS 



per message. 



C = — r (47) 

L(L + 1) K ' 



The achievability scheme in general is a scalar linear coding scheme with one-to-one alignments, where 
each message is sent through one scalar symbol over a L ^ L 2 +1 - ) dimensional signal space. L 2 — L interfering 



messages are aligned into £ ^ L 2 +1 ' 1 — L dimensions using the pattern shown in 1 190 where this pattern is 



repeated for each group of L + 1 consecutive transmitters for the shown CBIA setting in Figure 10 The 



L desired symbols are chosen to not have intersection with the interfering signal space and over ^ +1 - > 
channel uses, they are resolvable. The details of the achievability proof and the converse are presented in 
Section |9] 



5 Proof of Theorem [Tj Insufficiency of Linear Codes for Multiple Uni- 
cast Index Coding 

Notation: We use the overline notation to describe quantities in the equivalent multiple unicast setting. For 
example, for a given achievable scheme over the multiple unicast setting S, n, 1Z represent the alphabet, the 
number of channel uses, and the rate vector respectively. The codeword and the encoding function for this 
achievable scheme are denoted as S" and f{Wi >0 , Wi,i, . . . , Wm,l) respectively. 



5.1 Proof of Theorem [2]- Property 1 

The rate (R\, i?2, • ■ ■ , Rm) is achievable in the groupcast setting. This means that, for every given S > there 
exists n and an encoding function f(Wx, W2, ■ ■ ■ , Wm) such that over n channel uses, destination D m can 
decode message W\ m m of rate R\ m /t\ — 5|" m /L] with a probability of error e^waj^jto for some <5[ m /L] < S. 
Without loss of generality, assume that the alphabet for the achievable scheme is S = {0, 1}. Now, we turn 
to the equivalent multiple unicast setting. In this setting, we will show that for any e > there exists 
an achievable encoding scheme (over the same alphabet S = S = {0, 1}) with rates as in 1 35 1 such that 
the probability of error for each message is smaller than e. The encoding scheme for achievability can be 
described in the following three steps. 

Step 1: Re-use / to encode Wij, j 7^ 0, i = 1, 2, . . . , M, and 



Step 2 
Step 3 



form a random codebook for messages Wi t o, i — 1,2, ... , M, and then, 
superpose the M + 1 codewords formed in Step 1 and Step 2. 



We will show that this approach achieves the desired rates in the multiple unicast setting. For the achievable 
scheme, we set the block length to be n = nriQ where jiq is chosen to be sufficiently large for purposes that 
will be described. For this blocklength, we have |Wjj | = 2 nn °( Ri ~ Si \ For Step 1 above, we first represent 
Wij as nrio(Ri — Si) bits, and then split it into tiq blocks of length n(Ri — Si). Thus, we have 



w i , j = (wf],wfl,.. 

where W t j is a bit vector of length n(Ri — Si). Now, we set 



(48) 



where the sum above is a bit-wise XOR of all the messages. Note that W \ is a bit-vector of length n(Ri — 
Si). Now, we are ready to form the codeword of Step 1, denoted as S*™" e {0, l} Tm °. 



(49) 



-~nn 
^1 — 






A 


\ Si'' 1 




. f(w [ r ] M°\ 


:,Wt 0] ). 







3 This is because, as noted in Sectionplthe e-error capacity is the same as the zero-error capacity in the index coding problem. 



where S™ is an n bit column vector. 

Now, we proceed to Step 2. Message Wk.o, is encoded in {0, l}' 1 " by treating the codeword as no block- 
symbols, each of size n, and using i.i.d. random coding for each block. In particular, we generate a random 
codebook of length no over alphabet {0, 1}", where each of the no entries are chosen uniformly over all 
possible n-bit vectors, and independently of each other. We denote by S™ , the codeword corresponding 
to Wifi, and set 

M 

S =Z^ S ifi ) ( 50 ) 
t=l 

where So,S ifi £ {0,1}. 

We are now ready for Step 3. The codeword sent on the multiple unicast channel is set as 

„nno _ ^nno ~^nn 
O - il + Oq , 



where 5™"° and S^" are chosen from 1 49 i and 1 50 >, and the addition is bit-wise XOR of the corresponding 



entries of the vector. 

Now we need to show that each message can be decoded with a probability of error smaller than e. 
First, consider message Wjj,j ^ to be decoded at destination Dij. Note that this destination has, as 



antidotes, W.,o because of (31 1. Therefore, it can compute S m ° and subtract it from the received codeword 
to obtain f(W™^ , W^, . . . , wff) f° r m = 1,2, . . . ,no. For all I such that Wi € A^-i)L+j, this destination 
has, as antidote, {Wij : j = 0, 1, 2, . . . ,£} and can therefore compute Wi- Now, destination D it j resembles 
destination Du_i\ L+ j in the groupcast setting. In particular, the decoding strategy used by D^_^ L+ j in 

the groupcast setting can be used by Dij in the equivalent multiple unicast setting to decode w[ m ', free 
of error. Also, note that the destination has {Wi i : / ^ j} as antidotes. Therefore, it can decode W"j from 

Now, consider destination D^o- This destination has W — W< as antidotes. Therefore, it can compute 
Sk"o > ^ 7^ * and cancel the effect of this to obtain S 1 ™" + S i0 ° . Intuitively, the signal obtained after this 
cancellation can be interpreted as an additive noise channel over alphabet {0, 1}™, where the noise vector 

faced by the mth channel use is s\™ , m = 1, 2, . . . , n . Using the random coding argument, for sufficiently 
large n , Wiq can be decoded by this destination with probability of error smaller than e, for any e > for 
any rate Ri.o up to the mutual information 



is: 



Si,o° iAi,o; Wifi 



nn 
1 



Tin \ 



nn 
1 - 

1 - (Ri - Si) 



<Si,o°IA,o) 



— H(ST° 
nn n 



(51) 



■S??\Ai, a ,Wi, ) (52) 



J-H(S" m \A,o,W ifi ) 
nn n 



(53) 
(54) 



where, 1 53 1 comes from the fact that 5 i is uniformly distributed over {0, 1}"™° through the random coding 
construction (and is independent of Aifi). The final bound comes from the fact that given (Aifi, Wifi), S™ n ° 
is an invertible function of Wi and H(Wt) — nn a (Ri — Si). 



5.2 Proof of Theorem [2]- Property 2 

Here, we first consider the case where rate (Rx, R2, ■ ■ ■ , Rk ) is achievable in the groupcast setting via a linear 
coding based achievable scheme. This means that for any 8 > 0, there exists n.,U;, 2 = 1,2, ... , ML, k = 
1,2, ...,M, such that 



U,V fc =0, VI = 1,2,. ..,ML,W k $Ai,k?\l/L] 



(55) 



det(U,V ri/il )^0, Vie {1,2,..., ML}, (56) 

where is a n x rt(i?fc — <5fc) dimensonal matrix, and U/ is a n(-Rr;/M — ) x rt matrix, where 5^ < 6. 
The encoding scheme for this setting is 

M 
k=l 

where S e F and is a n(Rk — 8%) x 1 vector. Now, we turn to the equivalent multiple unicast setting, 
where we provide linear encoding achievable scheme for rates 



The above automatically implies achiev ability of rate | |36| . The encoding scheme for the multiple unicast 
setting is formed by setting Vjj = Vj for j ^ so that 



M M 



S n = J2 V <-./ A <- X V ' (' V ' : + ^ < •• + ■ • • + -V - / ) + J2 V < ::A '< 



where 5 e F, X^- is a n(i?j - ^) x 1 vector for j ^ and n(l — x 1 vector for j = 0. The matrix V^o is 
chosen to be a n x n(l ~ B4) full rank matrix so that 

colspan(Vj) n colspan(Vj, ) = {0}. 

Since rank(Vi) < n(Ri — Si) there exists a matrix V ij0 satisfying the above condition. This condition auto- 
matically implies that there exists U;.o such that 

U;,oVi = 0, 
det(lJ 1 ,oV il o) £ 0. 

Since {Wij : j — 1, 2, . . . , L} are the only messages not present at D ii0 as antidotes, and all these messages 
are encoded using Vj, the above equations imply that the destination can decode message Wifi. Now, 
consider destination Dij,j 7^ 0. We can set U^j = U( i _ 1 ) i+:) . Then, we have for j 7^ 0, 

U(i-i)L+jV fe = 0,Vk^ A^i )L+J ,k^i 



and, because of 1 56 1, we have 

det (UijVij) = det (U (i _ 1)i+i V^) ^ 



as required. This ensures achievability of rates |36| via linear coding in the equivalent multiple unicast 
setting. 

Now, we consider the case where rates in 1 36 1 are achievable in the multiple unicast problem via linear 
coding. This means that for any 5 > there exist matrices Vj.j,Uj .j, i = 1,2, . . . ,M j = 0, 1,2, . . . , L, such 
that 

UijVfc,, = 0, Vfe, I s.t. W k j i Ai d ,(k,l)^(i,j) (58) 
del;U,.,V, , 

where V,; j is a n x — <5,; j) matrix if j : 7^ and n x n(l — Ri — <Jj,o) matrix if j = 0, for some <5jj < S,j — 
0,1,2, ... ,L,i — 1,2,..., ill. Similarly, Ujj is a n(Ri — 5ij) x rt matrix if j 7^ and it is a n(l — Ri — Si t o) x n 
matrix if j = 0. Now, note that the above implies that 

Ui, V^=0,j = l,2,„. > L. 



We now intend to show that (i2j , R 2 , . . . , Rm) is achievable in the groupcast setting via linear coding. For 
the achievable scheme on the groupcast setting, we choose 

L 

V i = <f|ool8 P an(V i)J -)). (59) 

where, for a set of column vectors V, the matrix denoted by (V) is a full rank matrix whose columns span 
V. We now intend to show that rate R i is achievable for Wi where nR { = rank(V;). To show this, we need 
to prove the following. 

(CI) We need to show that there exist nR i x n decoding matrices Uj, i = 1, 2, . . . , ML that satisfy the desired 
critiera, i.e., that 

U l V m = 0,W m $A l ,m? \l/Ll 
det(U ; V r;/L1 ) ^0. 

(C2) We need to show that the above has an appropriate rate, i.e., we need to show that R i can be made 
arbitrarily close to Ri . 

Instead of (CI) above, we show equivalently that there exist matrices such that 

u;v m = o,yw m ^A l u{w m - ] } 

rank(U;V r , /L1 ) = rank(V [;/L1 ), V i e {1, 2, . . . , ML}. 

For this we set 

U^-dl+^U^. (60) 
Let W m £ Ai,m ^ \l/L\. Because of | [3T) , we infer that 

W m ,k i •A\i/ l -\,i-l{\i/l\-x) U {W\i/ L -\,i-L{\i/L-\-i)}^k = 1,2, . . . ,L (61) 
^ *%£!,,_£([-,/£!_!) V m , fc = 0,Vfc = l,2,...,L (62) 
^U|V m = (63) 



where the final equation comes from 1 59 1 and ( |6u) . In particular, the final equation comes from noting that 
colspan(V m ) C colspan(V mj j), I ^ 0. Similarly, we have 

det(U r;/L1i; _ L(r;/L1 _i ) V r;/Ll!; _ L(ri/L1 _ 1) ) ^ (64) 
^det(u;V R/ £ 1 , ; _ i(ri/ £ 1 _ 1) ) ^ (65) 



rank^U,V r//L1 ) = rank (V fi/L1 ) , (66) 



where, the final equation follows because colspan(Vp ; / L ^ ) C colspan(Vr;/£i i-l(\i/l~\-V))- This completes 
the proof of (CI). We now need to show (C2). To show this, we use the following Lemma 

Lemma 1 Let A, B, C be three matrices respectively of sizes n x Da,ti x Db and n x Dc- Ifcolspan(A) n colspan(C) = 
{0} and colspanfJS) n colspan(C) = {0}, then, 

dim (colspan(A) n colspan(B)) > rank(A) + rank(B) + rank(C) — n. 

Proof: Let the rank of A, B and C respectively be Ra,Rb and R c . Also, let 

Raub = dim (colspan((colspan(A) U colspan(B)))) 

Rahb = dim (colspan(A) n colspan(B)) . 



where (V) is a matrix whose column-span is equal to the column-span of the set of column vectors V. We 
have 



colspan((colspan(A) U colspan(B))) n colspan(C) = {0} 

=> Raub + dim(colspan(C)) = dim (colspan ((colspan(A) Ucolspan(B) U colspan(C)))) 

Raub + Rc < n 

^R A + Rb - Rahb + Rc < n 

=> dim (colspan(A) n colspan(B)) = Rahb > RA+RB+Rc~n 

Using the fact that colspan(Vjj) n colspan(Vi.o) = {0} and the above lemma, we lower bound the rank 
of Vj = (C\f =1 colspanCV^;)) as follows. 

nR\ = rank(V s ) = dim [ f] colspan(V M ) j (67) 



\i=i 



> dim ( Q colspan(V 4 , ; )^ + rank(V ijL ) - (n - rank(V 4 , )) 



(68) 



'L-1 \ L 



> dim p| colspan(V M ) + ^ rank(V M ) - 2(n - rank(V li0 )) (69) 



J=l / l=L-l 



; (70) 

L 

> dim (colspan(V M ) n colspan(V^ 2 )) + ^]rank(V M ) - (L - 2)(n - rank(V i)0 )) (71) 



1=3 



> J2 rank(V M ) — (L— l)(n — rank(V,- )) (72) 

i=i 

L 

= n{Ri - Sij) - (L - l)(n - n(l - .Ri - M) (73) 
z=i 

>n(Ri-J2Si,i-(L-l)5i,^j 



(74) 

where, the final equation above comes from the achievability of rates p5| l on the multiple unicast setting. 
By choosing Si j to be arbitrarily small, the rate R^ can be made arbitrarily close to Ri. This completes the 
proof of (C2), and hence the proof of the Theorem [2] 

5.3 Proof of Theorem |l] 

We show Theorem [T] using the results of Theorem [2] and 11261 . (26 1 implies the following. There exists a 
groupcast setting where, there exists a rate vector [Ri,R 2 , . . . , Rk ) in its capacity region that not achiev- 
able via linear coding. Now, we consider the equivalent multiple unicast setting based on Construction [T| 
Because of Property 2 of Theorem [2] this implies that rate-tuple 



Ri, 



Ri 3^0 
1-Ri j=0 



is not achievable via linear coding in this multiple unicast setting. Further, because of Property 1 of Theorem 
[2 the achievability of (Ri, R2, • ■ • , Rk) m the groupcast setting (using a non-linear scheme) implies that the 
above rate-tuple is achievable in the multiple unicast setting. Therefore, linear coding does not suffice for 
achievability in the multiple unicast setting. 



6 Proof of Theorem 

6.1 Achievability 



Feasibility of rate per message 



We partition W = {Wi, W2, ■ ■ ■ , Wm} into alignment subsets Vi,T > 2, ■ ■ ■ , Vt, and define the mapping P(m) : 
M->{1,2,--- ,T} so that W m &V P{m) ,VmeM. 

We use a scalar linear achievable scheme. In particular, we choose n = L + 1 and S = ¥ q , where q is 
chosen to be sufficiently large so that there exist T vectors Vi, V2, • • • , Vy, in the L + 1 dimensional vector 
space over ¥ q , such that every L + 1 of them are linearly independent. These are the linear coding vectors 
along which the aligned messages from each partition will be sent. 

Destination r, which desires messages W r = {W ri , W r2 , ■ • • , W rL }, receives 

M 

S n = $>iV P(i) 

i=l 

i=l i:Wie^l r \i:W,^ r UW r / 

Side information 
at destination r 

where the last equation follows because all elements of {i : Wi $ A r U W r } belong to the same alignment 
subset by definition; this subset is denoted by Vt- After cancelling the second term above, destination r 
obtains a linear combination of L + 1 vectors, Vp/ ri ), Vp( r2 ), . . . ,Y P f rL -\ and V-p t . We need to show that 
these L + 1 vectors are linearly independent if the feasibility condition is satisfied. The feasibility condition 
implies that for a feasible system, if two messages belong to the same alignment subset, then whenever one 
of them is desired, the other must be available as an antidote. Since this is not true for any two of the L + l 
vectors Vp( ri ), Vp( r2 ), . . . ,Vp( ri ) and Vp t , they must each belong to distinct partitions. Since every L + l 
coding vectors are linearly independent, we conclude that resolvable at destination r as 

required. 



6.2 Outer Bounds 

We start with a simple information theoretic outer bound on the index coding problem. 

Theorem 8 The achievable rates (Ri,R2,...,Rm) for the index coding problem where |Wfe| = L, V k e 
{1,2,..., K}, satisfy the following inequalities: 

Ri + Yl ^ : ' Vfc e /C (75) 

{i:WidW k } {i:W i ew J n(w fc u.4 fc ) c } 

In other words, the sum of the rates of the messages desired at a destination and the rates of the interfer- 
ing messages (i.e., undesired messages that are not available as antidotes to D^) that are intended for any 
other destination Dj cannot exceed 1 . 

Proof: Consider a reliable achievable coding scheme for the network. Assume all the messages except Wk 
and Wj H (Wk U Ak) c are given by a genie to destination fc and destination j. Further, assume that the genie 
provides Wk to destination j as well. Now, using the reliable achievable scheme, destination fc can decode 
Wk- Having decoded Wk, in this genie aided index coding problem, destination fc has all the information 
that destination j has. Therefore, destination fc can decode Wj n (Wk U Akf as well. Since destination i can 
decode messages W k and Wj n (Wk U Akf, the sum of H(W k ) and H(Wj n (Wk U A k ) c ) must be bounded 
above by the entropy of the bottleneck link H(S n ). 

Remark: A reader familiar with Carlieal's outer bound for the interference channel ||32ll may notice sim- 
ilarities with the approach in the above proof. Also note that the bound, as with all bounds in this section, 
is information theoretic, so it applies to all linear and non-linear coding schemes. 



As an example, suppose we have an index coding problem with M = 4, K = 2 and L = 2, where Wi = 
{Wi,H / 2 },H ; 2 = {Wa,Wi},Ai = 0,A% = {Wi,Wu} as shown in Fig, [TT] Using the above outer bound, we 
have i?i + i?2 + i?3 + -R4 < 1, which is tight because it is also achievable by sending each message separately 
in 4 consecutive time slots. 




Figure 11: Index coding problem with M — 4, K = 2 and L = 2 where R\ + i?2 + R3 + Ri < 1 

Next we derive outer bounds which are valid for general index coding problems and are tight for the 
-jJpY rate feasibility problem. The main result of this section is the following. 

Theorem 9 For any N + 1 distinct indices io, i±, %n € Ai and N indices j 1 , ...,j N E /C, if 

,1 ,2 -3 N-l JV 

m U W n h W l2 h... J ^ W iN _, U W lN and W lQ $ A k where W lN e W k for k e K, (76) 

then 

Ri + R j} l + R n + ... + R 3 n l + Ri N < N (77) 

The converse for Theorem|3]follows from the above theorem. This converse is expressed next as a corollary 
to the above theorem. 

Corollary 2 The rate tuple TZ with Ri = R 2 = ... = Rm = j^i is not achievable in a single bottleneck network 
where \Wk\ = L, V k e {1,2,..., K} if and only if there exist distinct indices i,j<EM such that Wi, Wj belong to 
the same alignment subset and Wj <E Wk and Wi £ Ak for k e K, 

Proof. If Wi , Wj belong to the same alignment subset, there must exist a chain of alignment relations 
connecting W ir Wj as 

• 1 -2 -3 -N — l -N 

Wthw^hw^h W lN _ x ^W 3 (78) 

where N is the length of the chain. If Wj € Wfc and Wi £ Ak for k € K, from the result of Theorem 5 shown 
later in this paper we have an explicit rate bound 

Ri + R,.i + Ri, + i?,2 + ... + R tN _ 1 + R jN + Rj < N (79) 

Clearly, Ri = Ri, = R i2 = . . . = Ri N _ 1 = Rj = Rji = Rji = . . . = RjN = does not satisfy the above 
bound. This completes the proof of the corollary. 



6.2.1 Proof of Theorem^] 



Theorem[9]affords a simple proof for N = 1. We therefore begin with this case. 



If N = 1, we have Wi A Wj , Wj €W r ,Wi<£A r for r € £, then we intend to show that Ri + R 3 + R kuL < 



1. 



Consider any reliable index coding scheme. Now, we form a genie-enhanced index coding problem 
(where the scheme continues to remain reliable) as follows. Let Wi € W m . In this enhanced problem, 
assume that destinations k, r, m are given all the messages except messages W i j^ 1 M---,k L by a genie. We 
also assume that Wk u k 2 ,...,k L are provided via the genie to destinations r,m. Since Wj € W r , for a reliable 
scheme, destination r can decode Wj . In this enhanced problem, either m = r or destination m is a degraded 
version of destination r. In both cases, destination r can decode Wi as well. Now consider destination k. 
Achievability implies that destination k can decode Wk x ,k 2 ,---,kL- Now, destination k has all the information 
available at destination r and can therefore deocode Wi,Wj as well. Since all messages Wi j,ki,k 2 ,— kt are 
decoded at a single destination, we can follow steps similar to the proof of Theorem |8j to the conclusion 
that R4 + Rj + R kl:L < 1. 



4, K = 3 and L = 2 where Wi 
= W2 as shown in Fig 



12 We 



As an example, suppose we have an index coding problem with M = 
{W 1} W 3 },m = {W 2 ,W 3 },W 3 = {W a ,Wi},Ai = {W 2 ,W i },A 2 = 0,A 3 
have the following alignment chain W\ ■<-> W4, W4 € W3, Wi ^ ^3. Using the above outer bound, we have 
Ri + R2 + R3 + Ri < 1/ which is tight because it is also achievable by sending each message separately in 4 
consecutive time slots. 



— 



W? — . 



Wo 



w. — 




Figure 12: Index coding problem with M = 4, X = 3 and L = 2 where i?i + i? 2 + i? 3 + R 4 < 1 

Now, we consider the case where N > 1. Without loss of generality, we assume that 5 = {0, 1} in 
the proof. We first present the proof explicitly for N = 2 and = 3. These cases do not afford a simple 
explanation as above, and capture all the ideas required for proving the theorem for an arbitrary value of 
N. We begin with N = 2. In what follows we show that for any 3 distinct indices i, j, k E M. and indices 

l,m G K if W t A Wj & W k , W k e Wp, WJ £ ^ p for p G K then i?, + ^ +R k + R h , L + R muL < 2 - 



H(W il);2 ,..., ; J = - /W,!, i^.WaIIoW (80) 

< /W lli2 it ;5",^ il)l2i ... ii J + o(r i ) (81) 

= /W 11 « ai ... 1 « £ ;5 n |W7 iJ . IliIa) ... il J + o(n) (82) 
= ^(5" I ^ 1)i2 ,...,,J 

-ff^l^ + ^n) (83) 

< n-H(S n I W?j) + o(n) (84) 



Similarly, 



H(W mumat ... tmL ) = nR milL < n~H{S n \Wl k ) + o{n) 
For destination p that is interested in W k , we have 



nR k 

■ 

< 
< 
< 



I(W k ;S n 1 A p )+o(n) 
I(W k ;S n ,W^ k )+o(n) 
I(W k ;S n \W? k ) + o(n) 
H(S n | W? k ) - H(S n | W?) 



•o(n) 



fr(5" | W£,.) + 

| Wf) - nRi 

H(s n | 



nRj — 7iRi + o(n) 



o(n) 

\ W tk) 
h o(n) 

I w. 



< 
< 



n(l-R h:L ) 
2 — Ri — Rj 



•n(l 

^1:1 



1? 



"11:1, 

Rmi 



) — ni?; — nRj + o(n) 



(85) 



(86) 
(87) 
(88) 
(89) 
(90) 

(91) 

(92) 
(93) 
(94) 



wher e ||9"T] | follows from Lemma |2kproved later in this section). Inequality | [93) follows from substituting 
from <|84) and | [85) into (92 I, and | |94| is obtained by dividing by n and taking the limit asn->oo. 

As an example, suppose we have an index coding problem with M = 5, K = 5 and L — 2 where Wi = 

{Wi,W 5 },W 2 - {Wi, W 2 },W 3 = {W 2 , W 5 },W 4 = {W2,W 4 },W 5 - {W 2 , W 3 },A! = {W 2 },A 2 = {W3M3 = 
{Wi,W 4 },.4 4 = {Wi,W 3 ,W 5 },A = {Wi,W 4 ,W 5 } as shown in Fig. [13] We have the following alignment 

chain W3 O W 4 «-» W5, W5 € W3, W3 ^ .A3. Using the above outerbound, we have R 3 + R\ + R 5 + i? 4 + 
R 1 +R 2 + R 5 < 2. 



(v lf (v 5 




Figure 13: Index coding problem with M = 5, K = 5 and L = 2 where i? 3 + i?i + i? 5 + R 4 + J?i + J? 2 + #5 < 2 

We next consider the case where the length of the alignment chain is N = 3. Suppose we have an align- 
ment chain of length 3, i.e., Wi A Wj A Wk & Wi and the ends of the chain interfere, i.e., Wi G W m , Wi ^ 
„4 m , then our goal is to show that Ri + R Pl L + Rj + R qiL + Rk + Rr 1:L + Ri <3- 

We start with the inequalities at the destinations p, q, r , each of which is shown exactly through the steps 
followed in |80)-@. 

H{W PuP2 ,..., PL ) = nR Pl:L <n-H(S n \Wf tj ) + o(n) (95) 
H(W quq2 _ qL )=nR qi:L <n-H(S n \Wl k ) + o(n) (96) 



nR riL < n 



H(W rur2 _ rL ) 
For destination m that is interested in Wi, we have 

nRi = 



H{S n \W c k ,) + o{n) 



< 



< 
< 

< 
< 



I(W i; S n ,A m )+o(n) 
I{W l -S n 1 Wl l ) + o{n) 
I{W V ,S n \ Wf tl )+o(n) 
H(S n \W? l )-H(S n \W?) + o(n) 
H(S n | W^)-nRi + o(n) 
H(S n \Wl J )+H{S n \Wl l ) 
-H(S n | Wf) - nR t + o{n) 
n(l - R PlL ) + H(S n | Wfr) - nRj - nRi 
n(l-R Pl:L ) + H(S n \W? k ) 
+H{S n \W c kd )-H{S n \WZ) 



-nRj — nRi 



< n(l - R p 



■ o(n) 
n(l- 



R qx . L ) + n(l- Rr 



o(n) 



nRk — nRj 



-nRi + o{n) 



R, 



< 



3 Rpi.L ^Ql.L ^Tl:L 



(97) 



(98) 
(99) 
(100) 
(101) 
(102) 

(103) 
(104) 



(105) 

(106) 
(107) 



Note that Lemma [2] is applied twice, first in arriving at < 103} and then to obtain [ 105) . 

Now we prove Theorem [9] for an arbitrary N > 2. Suppose we have an alignment chain of length TV, 

Ak where W lN € WV- for k e JC, then our goal 



W io h Wi, h W i3 h 

is to show that R it 



W iN _, U W lN and W l0 



R 



n-.L 



Ri 



Rj" r + Ri N < N. We start with the inequalities at the destinations 



3 



j , each of which is shown exactly through the steps followed in 1 80 l-l 84 1. 



) = nR^ <n- H(S r ' 



Wf 0til )+o(n) 



nR jlr <n-H(S n \W^ 2 ) + o(n) 



For destination k that is interested in Wi N , we have 



nR^N < n 

Jl:L 



H(S n | Wl 



+ o(n) 



nRi 



I(W lN ;S n ,A k 



< 

< 
< 

< 
< 

< 



I(W lN ;S n ,W^ tN 



o(n) 
) + o(n) 

H(S n | W, 

H(S n \W^ lN )~nR la +o(n) 
H(S n \Wf 0til )+H(S n \W? uiN ) 
-H(S n | Wn - nR l0 + o(n) 



Wl M )+o(n) 
lo , lN )-H(S n \W? o )+o(n) 
nRi, 



R, 



n(l «^ 
n(l-R j} J 
—nRi 1 — nR. 
n(l - Rji J + n(l 
-nR io + o(n) 



H{S n \W9^ N )-nR l 



%x ,in ) ' "~* 

H(S n \Wf ui2 )+H(S> 
+ o{n) 



- nRi + o(n) 



lllN )-H{S n \WQ 
R j1l ) + H(S n | Wf 2>iN ) - nR l2 - nR n 



(108) 
(109) 

(110) 
(111) 



(112) 
(113) 
(114) 
(115) 
(116) 

(117) 
(118) 

(119) 
(120) 



< n(l - R j1l )+ n(l - %J + ... + n(l - i^-,) + | W^ iN _ ± ) + H(S n \ W? N _ uiN ) 
-H(S n | W? N _ x ) - nR lN _ 2 + ...- nR n - nR io (121) 

< ntl-R,! ) + n(l-R i 2 ) + ... + n(l - R.N-2) + nil - R 4 n-A + n(l - R,n ) 

— V Jl:L' V Ji-.l' V 3 H:L J\:L' 

-nR lN _ 1 - nR lN _ 2 + ... - nR i± - nR ia (122) 
=> Ri N < N - Rji l - Rf iL - ■■■ - RjN L - Ri N -! - Ri N - 2 - ■ ■ ■ - Ri (123) 



Note that Lemma|2]is applied in \\17\ , 1 119 1 and | |121) . This proves Theorem|9] 



Finally, we prove Lemma|2]which is used repeatedly in the proof of outer bounds. 
Lemma 2 

H(S n | W? k ) < H(S n | W? d ) + H(S n | Wi k ) - H(S n \ W c t ) (124) 

Proof: Since entropy function is a submodular function, for any two subsets of random variables C, T>, we 
have the following 

H(C) + H(V)>H(CUV) + H(CnV) (125) 

If we choose subsets C, V as C = {S n , W?A, V = {S n , W? k } and use the submodular property of entropy 
function, we have, 

H(S n ,W z c j) + H(S n , W? k ) > H(S n , Wn + H(S n ,W° j<k ) (126) 

which is equivalent to 

H(S n | W? d ) + H(S n | W° k ) > H{S n | W?) + H(S n \ W^. k ) 

> H(S n \W[) + H(S n \Wl k ) (127) 



where 1 127| is true because conditioning reduces entropy. This completes the proof 



7 Proof of Theorem [5] 

7.1 Achievability 

First, note that i£A = U + D = K— 1, then it is obvious that each source can send 1 symbol per time slot and 
achieve a rate of 1. If A = K — 2, then, it is easy to verify that a rate of \ is achievable using Theorempl Here 
we show achievability for A < K — 3. Using linear coding, we show that each user can send L = (U + 1) 
symbols inan = K — A + 2U dimensional space F™, when q is sufficiently large. Note that because U < D, 
we have A — 2U = D — U >Q, and therefore, n < K. To begin the construction of our achievable scheme, 
we pick K vectors over F™ such that any n of them are linearly independent. We denote the vectors by 
z 1; Z2, . . . ,Zk. Note that over a sufficiently large field, the vectors z i5 i = 1, 2, . . . , K can be chosen to satisfy 
this propert}|^] Our construction for the n x (U + 1) matrix Vj — whose columns are the beamforming 
vectors for the U + 1 symbols comprising message Wi — is as follows. 

V, = [Zi Zj+l z i+2 ■ . ■ z l+u ] 

where all subscripts are interpreted modulo K. Note that any two adjacent messages overlap in U dimen- 
sions. For example, the signal spaces spanned by Vj and V i+1 overlap in dimensions z i+1 ,z i+2 , ■ ■ ■ ,z i+ fj. 
This is the basis for interference alignment. With this construction, we intend to show that (|5j is satisfied. 

4 One approach to such a construction is to use Vandermonde matrices. 



First, note that the U + 1 columns of V; are linearly independent since £7 + 1 < n. In particular, denoting 

Xj = [xi y o Xi,i . . . , Xi t u], we have 

K U 



1=1 j=0 

Now, because of the symmetric nature of the problem, we only need to show that W\ is linearly resolvable 
at D\. This ensures resolvability at all other destination nodes. Note that destination 1 has side information 
of Xij for all i E {K — U + 1,K — U + 2, . . . , K, 2, 3, . . . , D + 1}. The precoding vectors are known to everyone 
apriori. Cancelling the effect of the known symbols, destination 1 obtains 

U K-U U 



3=0 

Desired Signal 



i=D+2 j=0 



Aligned 
Interference 



Thus, destination 1 sees (£7 + 1) + (K — (D + 1)) = n signal vectors zi,z 2 , . . . ,Zu+1/Zd+2/ z d+3, ■ ■ ■, 
Zfc. Since these n vectors are linearly independent by design, the desired scalars X\ t j,j = 0,1,..., U are 
resolvable at destination 1. By symmetry, the same conclusion is applicable at all the K destinations. 

To understand the role of alignment, note that there are (K — A — 1) undesired messages that interfere 
at destination 1, each occupying a (U + 1) dimensional signal space. However, because any two adjacent 
interferers overlap in U dimensions, the (K — A — 1) interferers collectively occupy only U + 1 + (K — A — 
1) — 1 = K — D — 1 dimensions. Since the desired signal occupies £7 + 1 dimensions, and (£7 + 1) + (K — 
D — 1) = n, the observed signal space is big enough to resolve desired signals from the interference. See 
Figure 14 for an example. 





(b) 



Figure 14: Multiple unicast with neighboring antidotes where (D,U,K) = (2,1,8). (a) Desired message and antidote 
sets, (b) Capacity optimal solution - 2 symbols per message are sent with one-to-one pairwise alignments and a rate of 
| per message is achieved using the scheme pictorially depicted. z\ , . . . , z$ may be chosen to be the 7 columns of the 
7x7 identity matrix and the all l's vector. 



7.2 Outer bound 

The trivial bound, that a message can not be transmitted at more than rate 1 (the capacity of the bottleneck 
link) is tight when A = K — 1, i.e., all undesired messages are available as antidotes. If A = K — 2, i.e., only 
one message is missing from the antidote set, then the rate \ per user is the outerbound, as shown in the 
proof of Theorem[8] Therefore we focus on the setting A < K — 3. 

We first present an outer bound on the rates achievable via linear achievable schemes to gain intuition. 
Later, we will use this intuition to get information theoretic outer bounds for any achievable scheme. 



7.2.1 Dimension Counting Outerbound 

Our goal is to show that the sum-capacity per message is bounded above by K ^f[+2U ' ^° snow this, we 
first prove that if we want to achieve a rate d per message, then the total dimension of interference at each 
destination satisfies 



dim(interference) > 



K-A + 2U 

uTi 



K — A — 1 is the total number of interferers at each destination. Suppose Vj denotes the subspace assigned 



to user i. Hence, our goal is bounding Yli=i dim(Vi U Vj+i U . . . U Vi+k-a-i), i-e., the total number of 
interfering dimensions at all the destinations. We define <x, as follows 

K 

a 3 = dim(Vi UV, +1 U...U V i+j -t) (129) 

i=l 

So our goal is bounding the total number of interference at all destinations, i.e., ctK-A-l- 
Lemma 3 For the following m and j 



m= V u+i ^ j = {K-A-\) mod(Z7 + l) ifK-A-1 mod ([7+1)^0 
" 'fi-1, j = U + l ifK-A-1 mod (U + 1) = 



m ■■ 



K-A-l 



(130) 



u+ 



we have 

ax-A-i > max + otj (131) 

Proof 

&K-A-1 = a m {jj + i) + j 
K 

= 5^dim(ViUV i+ iU...UV m (p + i )+J -_ 1 ) (132) 

8=1 

K 

> ^{dim(Vj U Vi+i U . . . U V i+i _i U Vj +J -_ 1+ (c/ + i) U Vi+j_i +2 (£/+i) U . . . U Vi +3 -_i +ln (i7 + i$#3) 
i=i 
if 

= 5Z{dioi(Vi U V i+ i U . . . U V J+i _i) + dim(Vi+j _i + (£/ + i)) + dim(V l+j _ 1+2([/+1) ) + . . . 

i=i 

+ dim(V i+i _i +m ( C / + i))} (134) 
= mat + oij (135) 



where ( |134| ) is true becasue Wi,..., i+;; -_i,i +: ,-_i + ( C /+i),...,i+j-i+i(!7+i) i A+j-i+i{u+i) for / = 0,1,..., m. This 
proves Lemma [3] 

Lemma 4 For = 2, 3, ...,U + 1, we have 

Oi\ 

a j > jj—^ai (137) 

Proof. 

K 

dim(Vi U V i+ i U . . . U V< +i _ 2 U Vi+j+tr-x) (138) 

i=l 



if if 



< dim ( V * U Vi+i U . . . U V i+j - 2 U V i+:i -i) + J2 dim(V J+ i U V 4+2 U . . . U V i+i _i U V i+j+U -i) 

i=l i=l 
if 

dim (^+i U V 4+2 U . . . U V <+i _i) (139) 

z=l 

if if 

< ^ dim(Vi U V i+ i U . . . U V i+j _ 2 U V i+ j_i) + ^ dim(V i+ i U V 4+2 U . . . U V 4+i _i U V i+i ) 

z=l i=l 

if K 

+ dim(V J+2 U V l+3 U . . . U V l+3 U Vi +j+£ /_i) - ^ dim(V l +2 U Vi + 3 U . . . U Vi+j) 

i=l i=l 

if 

- ]T dim(V i+ i UV 1+2 U...U V i+j _ x ) (140) 



if if 



< ^ dim(Vi U V i+ i U . . . U V 4+i _ 2 U V i+ j_i) + J2 dim (H+i U V 4+2 U . . . U V 4+i _i U V i+j ) 
i=i i=i 

if if 
+ dim(V i+2 U V 4+3 U . . . U V l+j U V J+i+ i) + . . . + ^ dim(V,+ U U Vi+JZ+l U . . . U Vi+j+U-2 U Vi+_y+!7-l) 

z=l i=l 

if if 

- ^ dim{V t+u U V i+C7+ i U . . . U V 4+J+c/ _ 2 ) - . . . - dim (H +2 U V l+3 U . . . U V i+i ) 

i=l i=l 
if 

-^dim(V 4+1 UV, +2 U...U Vi+j_i) (141) 



where at each step, we use the submodular property of dim function. On the other hand, since {Wi , Wi+i , . . . , W{. 
•Ai+j+u-i, we have 

dim(V< U Vi+i U . . . U V J+i _ 2 U V i+j+U -i) = dim(Vi UV W U...U V i+j - 2 ) + dim(V i+j +u-i) (142) 
By combining | |141| and | |142| , we have 

ctj-i + O-x < dj + CKj + . . . + ay — Q!j_i — ... — Qy-i (143) 
<*j> jj^i{{U + 1)^-1 +«i} (144) 



This proves < 136) . Solving this recursive equation, we get [ 137) . This proves Lemma |4] 



Combining Lemma|3]and Lemma[4j we have the following 

U + j m(U + l)+j + U K-A-l + U 
a K -A-x > max + ^ + - «i = + 1 ai = U + 1 Ul 

Then the dimension of desired message plus the dimension of interference at all destinations should be less 
than or equal to the total available dimensions, i.e., K 

^{dim(V 4 _ D _i) + dim(V l U Vi+i U . . . U V l+K -A-2)} < ai + K ~u + \ + L ^ ^ K ( 146 ) 

2—1 

Hence the optimal symmetric rate per message is d = ~^ol\ < k ^a+ 2 u 



7.2.2 Information Theoretic Outerbound 

We define atj as follows 

K 



aj =Y J H{S n \Wl %+1 _ l+] _ l ) (147) 



A 
i=l 

Our first goal is to bound cxk-A-i- We proceed as follows. 
Lemma 5 For the following m and j 

m = l^u^l, 3 = (K-A-1) mod([/ + l) ifK-A-1 mod (£7 + 1)^0 
m= K --^- 1 -1, j = U + 1 jfif-A-1 mod(f/ + l) = 1 ; 

a/c_/i_i > mai + ctj + o(n) (149) 

Proof: Note that destination can decode W i+j - 1+ (u+i) from (5™, W? i+1 , i+j _ 1+(!7+1 )) 

with P e — > as n — > oo. Therefore, we can write 

ra.R i+j _ 1+([/+1) = i?(W i+i _i + ([/ + i)) (150) 

= I(W i+ j-i + (u+i)} S"\ w li+i,...,i+j~i,i+j-i+(u+i)) 

+ H (Wi +j -i+(u+i)\S n ,W^ i+1 ^ l+j _ li+j _ 1+(u+1) ) (151) 

-fl'(s n |WiVi,...,<+i-i) + ( n ) ( 152 ) 

which gives us 

^(^|^ i+ i,..., i+J -- M+J --i+(c/+i)) = (153) 

Next, note that destination D i+ j_ 1+2 (u+i) does not have W i: ..,^ +J _ 1 .j +J _ 1+ ( C / +1 ) ji+J -_ 1+2 ([/+i) as antidote. 
So, given (S n , W[_ i+j _ 1i+j _ 1+{u+1) i+j _ 1+2{u+1) ) it must be able to reliably decode W i+j _ 1+2(u+1) . 

n Ri+]-i+2(u+i) = H(W, l+j -i +2 {u+i)) (154) 
= I(Wi +J -i+2(u+i)': S n ,W£ ^ i+J _ l l+J _ 1+ ( U+1 ^ l+j _ 1+2 ( U+1 }) 

j-i+2{u+i)\S n ,W^^ i+J _ li+J _ 1+(u+1 ^ i+j _ 1+2 ^ u+1 )) (155) 

= / (^+J-l+2(L'+l);^™|M / ^...,i+ : ,-_l : i +: ,-_l + (;7+l), l +i-l+2((7+l)) +0(n) 

= H(S n \W?^ ii+ j_ l!i+ j_ 1+ ( U+1 ^ i+ j_ 1+2 ( U+1 )) 

-H{S n \Wl^_ u+j _ 1+{u+x) ) + o(n) (156) 

which along with [ 153} gives us 

^(^ n |^...,j+j-l,i+3-l+(C/+l),i+j-l+2(C/+l)) = "-^+j-l+2(C/+l) + ni?i +J -_i + ((7+l) 

+H{S n \Wl l+1 _ l+J _ l ) + o{n) (157) 

Similarly, we note that destination Z? i+3 _ 1+ ;([/ +1 ) for 3 < I < m does not have W i! ... ) j + j_i )i+ j_ 1+ ( [ / +1 ) i+j-i+z([/+i) 

as antidotes, so it must be able to decode W i+ j_ x+J (£/ +1 ) from (S n , W£. , i+J _ l i+:j _ 1+(u+lh ^ l+:j _ 1+l{u+1) ). 

n R%+j-i+m(u+i) = H(Wi + j-i +m (u+i)) (158) 
= /(Wi4.j-_i +m ((/ + i);jS n , W / ! ^... i i +J -i, ? : +: ,-i + (!7 + i),... il+: ,'-i +m ([/ + i)) + 

) (159) 



- i(w i+ j- 1+m (u+iy, s n \w? ii+J -_ 1+ (i 7+ i) ) . .., l+J -i+ m (u+i)) + °( n ) ( 16 °) 

= H(S n \W^^ i+ j_ li+ j_ 1+ ( U+1 ^^ i+ j_ 1+m ( U+1 j) — 

H ( Sn \ W i,.-,i+3-i,i+]-i + (U+l),...,i+j-l+("i-l)(U+l)) + °( n ) ( 161 ) 



which gives us 

H (S n Wt...,i+j~l,t+j-l + (U+l),...,i+]-l+m,(U+l)) = ^i+J-l+mCCI+l) + • • • + n-Rj + j_i +2 (£/+l) 

+ fl"(5 n | W7; i+li .. . ti+J _ x ) + o(n) 

(162) 



Our goal is to bound a^^j. We have 

oa-a-i = a TO (j/+i)+j (163) 

A 

= *£H(S n \W? i+1 _ i+K _ A _ 2 ) (164) 



i=l 
K 



) (165) 



i=l 
K 



= Y^i nR i+i-i+(U+i) + ■■■+ nR l+J -i+ m(u +i) + H(S n \W? i+1 ^ i+j _ x ) + o(n)} (166) 

i=l 

= mai + aj + o(n) (167) 



where 1 165} is true because conditioning reduces the entropy. 1 166} is derived by replacing 1 162} into < 165} . 



This proves Lemma [5] 
Lemma 6 For j = 2, 3, . . . , U + l,we have 



a, > aj -i + r^-j+o(n) (168) 



Proof: We use Lemma [7] (proved later in this section) at each step. So we have 

A A 
22, H ( Sn -> W i > i+l,...,i+j-2 > i+j+U-x) ^ ?A H (' gW ' W i,i+1 



i=l 

+H(S", Wi + i !i+ 2,..,,i+j-i,i+j+u-i) 
-H(S n ,W- +hl+2 _ l+J ^)} (170) 

A' 

< }j//i.v".u;,. : ,., 

s=i 

+H(S n , Wi + i !i+2 ,..,,i+j-i,i+j) 

+H(S n , Wf + 2,i+3,...,i+j-l,i+j,i+j+U-l) 
-H(S n , W j C +2 ,i+3,...,i+i) 

-ff^",^,^...^..!)} (171) 



A" 



i=l 



+H(S n , W^ +lii+2 ,...,i+j-i,i+j) 

+ . . . + H(S n , W[ +U i+U+1 ^ i+ j +u _ 1 ) 

—H(S n , Wi +U i+u+1 ^ j + j +u _ 2 ) 

— ... — H(S n , Wf+2,i+3,...,i+j) — 

^^ W v,hh)} ( 172 ) 

We note that destination D i+ j + u-\ does not have Wi t i+x,...,i+j-2,i+j+u-i as antidotes, so it must be able to 
decode W i+j+U -i horn (S n , W? i+1) .„ ii+i _ aii+j+ p_ 1 ). 

ni?i +i+C /_i = (173) 
= J(Wi+j+i7-i;5 n , W^ i+1 i ...,i+j_2,i+j+t/— 1) + 

ff^^r,^,,...^^^.,) (174) 
= /(W i ;5"|W^ i+1) „. ii+ . J ._ a , i+J+l ,_ 1 ) + (n) (175) 
= ff(5 n |W^ i+li ... ii+i _ a>i+J . +t ,_ 1 ) -ff(5 n |W,P i+li ... ii+i _ 2 ) + o(n) (176) 

which gives us 

ff(^l^m,..,<+i-2,<+i+tf-i) = nR i+j+U -i + H{S n \Wl i+lt ... ti+j _ 2 ) + o{n) (177) 
Replacing ^177) into l |172) , we have 



OLj-l + OL\ + o(fl) < OLj + OLj + . . . + OLj — OLj-\ — ... — CXj-l (178) 

a j >jj^ l {{U + l)a j - l +a l } + o{n) (179) 



Solving this recursive equation, we get 

^> |±y«i+o(n) (180) 

This proves Lemma [6] 

Combining Lemma|5]and Lemma|6j we have the following 

U + j . , m(U + l)+j + U . , K-A-l + U . . , 101 . 
a^_A-i > m«i + ^ + j- cti + o{n) = JJ + 1 + = U + 1 Ul + 

Finally, we note that destination D i _£ ) _ 1 does not have any of the messages Wi ) i+i ) ...^+x-A-2 as antidotes. 
So it must be able to decode Wi-D-i from (S n , Wi-D-\,i,i+i,...,i+K-A-2)- 

K K 



^nlk-D-i < ^l(W i -n-i\S n \W?_ D _ 1<i>i+l! .., ti+K _ A _ 2 )+o(n) (182) 
= ]T i H (S n \W?-D-i,i,i+i,...,i+K-A-2) ~ H(S n \Wl l+1 _ l+K _ A _ 2 )} + o(n) 



i=X 
K 



^ + d83) 



17 + 1 

Rearranging terms and applying the limit n — > oo we have 

< x (184) 

■i— 1 

so that we have the information theoretic capacity outer bound of K Y^2U P er messa g e - 



Lemma 7 For I = 0, 1, . . . , U — 1, we have 

^(^ n \^i+l,i+l+l,...,i+l+j-2A+j+U-l) — H(S n \W^ +li+ i +1 ^ i+l+ j_ 1 ) 

+H(S n \Wi + i + i^i + i + 2,...,i+l+j-l,i+j + U-l) 

-H(S n \Wr +l+1<i+l+2 ,... A+l+j _ 1 ) (185) 

Proof: Setting 

C = {S n , W? +i 

V = {S n , W i c +/+l i+;+2j ... : i+i+j-i,i+j+u-i} 
and using the submodular property of entropy function 

H(C) + H(V)>H(CUV) + H(Cr\V) (186) 

we have 

H(S n , Wi + i^„ ji+ i + j_i) + H(S n , W, c +/+1] s+j+u-i) ^ H(S n , W£ +l+l i+l+2: ...A+i+j-i) 
Equivalently 

H(S n \Wf + i i-i + H(S n \W£ +l+1 ^ i+l+ j_ lti+ j +u _ 1 ) > H{S n \W![ +l+l i+l+ 2...^i + i + j-i) 

+H{S n \Wi + i^ + i + 2,... : i+i+j-i,i+j+u-i) 
> H(S n \Wf +l+li+l+ 2 : ...,i + i+j-i) 

+H{S n \W![ +li+l+ 2^.^i + i + j-2,i+j+u-i) 

(187) 

where | |187| is true because conditioning reduces the entropy. Equivalently 

H(S n \W£ +li+l+1 ^^ ,.i+l+j-2A+j+U-l) — ^K^ n \^i+l,i+l+l,...,i+l+j-l) 

+H(S n \Wf +l+1 ,i+i+2,...,i+i+j-i,i+j+u-i) 
-H(5 w |W^ I+liifJ+ai ... ii+I+: ,._ 1 ) (188) 

This proves Lemma [7] 

8 Proof of Theorem [6[ Neighboring interference 
8.1 Achievability 

We use a scalar linear achievable scheme. In particular, we choose S = F^ +1 . Suppose we define Vi, V2, . . . ,Vd+i 
as D + 1 linearly independent vectors in a D + 1 dimensional space. Then, starting from any arbitrar- 
ily chosen message, we assign these vectors to D + 1 consecutive messages respectively and repeat this 
assignment periodically. So if we assign V TO to message W r , then message Wi will be encoded using 
V(j_ r + m ) mo d (D+i)- We now need to show the resolvability of message W r at destination r. Suppose V m 
is the vector assigned to message W r . Notice that there are U + D interfering messages at this destination: 
W r -u.r-u+i,....r-i.r+i.r+2....,r+D- The remaining messages are available as antidotes and can be cancelled. 
The U interfering messages W r -u, ■ ■ ■ , W r -i are encoded over V ( m -u) mod (d+i>- • • , v (m-i) mod (.d+i), 
respectively. Since {i + D + 1 mod D + 1} = {i mod D + 1}, these interfering vectors are the same as 
V( m+D +i-u) mod (d+i)/- • ■ , V(m+D) mod (D+i)- Also the remaining D interfering messages of destination r, 
i.e., W r +i, W r +D are sent over V (m+1) mod ( D +i), ■ ■ • , V (m+Z3) mod {D+l) , respectively. As we can see all 
the interfering messages align in the Z?-dimensional space spanned by vectors V( m+1 ) mod (d+i) > • • • > V(m+ d) mod (D+l) ■ 
This interference space is linearly independent of V m because of our construction. Since we are operating in 
a D + 1 dimensional space, W r is linearly resolvable at destination r as required. This proves achievability. 

Remark: V 1; V 2 , . . . ,Vd+i can be chosen to be columns of D + 1 dimensional identity matrix and the 
field can be chosen to be ¥2 ■ In other words, a D + 1-symbol long achievable scheme is obtained by simply 
sending the bit ... © x r - D ^i © x r © x r+ n+i © . . . , in the rth channel use, where © denotes the XOR. 



8.2 Outerbound 



For the converse, note that we can set U = 0. This is because, if U > 0, a genie can provide messages 
W r -u,r-U+l,...,r-i to destination r as antidotes to make an enhanced index coding problem where each 
user is only missing D + 1 antidotes after the desired message, where we will show that the capacity is jppi 
per message. In this setting where U = 0, we intend to show that 

R% + R i+i + • ■ • + R i+D < 1- 

To do so, we give destinations i, i + 1, . . . , i + D messages W? i+1 ___ i+D through a genie. Now, in this genie- 
aided system, note that destination i, which is missing antidotes Wi+i,i+2,.. .,«+£> can decode W. t (because 
of achiev ability in the original index coding problem). Having decoded W%, this destination has all the 
messages present at destination i + 1, and can therefore decode W; L +\. Having decoded Wi t i+i, destination i 
is now equipped with all messages present as antidotes at destination i + 2 and can therefore decode W i+2 . 
Continuing this argument, it can be shown that destination i can decode Wi.i+i,....i+D, arid therefore we 
have, as required, 

R% + R i+i + • ■ • + R i+D < 1- 



9 Proof of Theorem [7| X network setting with local connectivity 
9.1 Achievability 

We use a scalar linear achievable scheme. In particular, we choose S = ¥ q and n = L ( L 2 +1 ) . We choose 

Vi, V2, . • • ,V i([+i) to be L< - L 2 +1 - > linearly independent vectors in a n = £( - L 2 +1 - ) dimensional space. Each 

message is sent over one of these vectors. 

Consider an arbitrary destination, say destination k. After removing antidotes, this destination receives 



a linear combination of L messages shown in 1 189 1 above the line, and it is interested in L messages 



Wfc — {W A feL,fcL+L-l,(fc+l)L+L-2,--- ,{k+i)L+L-i-X,— ,{k+L-2)L+l\ 



shown in blue in 1 189 1. The destination faces (L — L) interfering messages shown in 1 189 1 with colors other 
than blue (and above the line). The "blue" messages are desired by destination k and are encoded using 
L linearly independent vectors, Vi, . . . , ~Vl- Because the total number of dimensions is L ^ L ^ rl \ the L 2 — L 
interfering messages should align such that they span at most L( - L 9 +1 - > — L dimensions. Among these inter- 
fering messages, the L—l "red" messages W(i-+i)L,(k+i)L+L-l) (fc+L-2)+2 are desired by destination k + 1 

and should be linearly independent. These messages in red are encoded using V^+i , . . . , "Vsl-i. Also all the 
L — 2 "green" messages that are below the red messages and above the line, i.e., W(k+2)L,(k+2)L+L-2,...,{k+L-2)- 
are not available as antidotes at either destination k or destination k + 1 and therefore seen as interference 
at both these destinations. This implies that the red messages and blue messages can not align with these 
green messages. So we assign V 2 l, • • • , V 3 ^_ 2 as encoding vectors respectively to green messages. Proceed- 
ing thus, we assign V L ^ L+1 y 2 to ^(fc+z-ilz- So far, we have assigned L(L + l)/2 — L linearly independent 
vectors to L(L + l)/2 — L interferers at destination k. 



W(k-1)L+1 W / (fc-l)i+2 

WfcL+1 W kL+2 
W(k+1)L+1 W r (fc+l)L+2 



W / (fc+L-3)L+l W / (fe+L-3)L+2 
W / (fc+L-2)L+l W( k+L ^ 2 )L+2 



W r (fc+i-l)i+l W / (fe+L-l)L+2 



w, 



(k-l)L+L-l 



w kl 



w, 



(k+l)L 

W^(fc+1)L+L-1 W(k+2)L 



(189) 



(k+L-3)L+L-l 
{k+L-2)L+L-l 



w, 



(k+L-2)L 



W / (fc+i-l)i+i-l W( k+L ) L 



So the remaining interfering messages that are shown above the blue messages at ( 189 1 should be sent 
over the vectors such that they stay in the same span of interfering messages below the blue messages. 
The way that we satisfy this constraint is by sending the messages shown in [ 189) respectively over the 
following vectors 



V2L V 2 l+i 
V3L-1 V3L 

Vi V 2L -i 



V a 



V 2£ -l 

v 2 

V L+2 



v L+1 

V 2L 



(190) 



V -Lf-L + l) 
_2_ 



Vl- 



Evidently at destination k, all desired messages are seen over Vi, . . . , and all the interfering mes- 
sages are seen over Vl+i, . . . , Vl+l+i) . Therefore, desired messages are resolvable at destination k. How- 

2 

ever our goal is showing that all the destinations are able to decode their desired messages. If we assign 
Vi,V 2 ,...,V L _i,V L to W( k +L-i)L+i,W( k +L-i)L+2, ■ ■ ■ ,W(k+L-i)L+L-i,W(k+L)L respectively and repeat 
the pattern shown in 1 190) as assigning vectors periodically for the remaining messages, we can show re- 
solvability at every destination. This follows because, if we choose any L consecutive rows (circularly) of 
1 190 1, the vectors assigned to anti-diagonal messages which are desired messages are linearly independent 
from each other and from the vectors assigned to interfering messages. This proves achievability. 

Remark: Vi, V2, ■ • • , "V hl±i) can be chosen to be columns of L ^ L 2 +1 - ) dimensional identity matrix, e.g., 
over F2 and therefore can be an orthogonal scheme. For the corresponding CBIA problem this means that 
channel coherence is not required [EJ. 



9.2 Outerbound 

To prove the outerbound, consider a set of 

Wo = {W k L,kL+L-l:kL+L,-- 



L(L+1) 



messages 



,(k+i)L—i—l:(k+i)L+L,— ,(fc+L-2)L+l:(fc+L-l)L 



}■ 



i.e., the set of "colored" messages shown (above the line) in 1 189) . Among the messages in Wo, the number 
of messages intended for destination k + i is L — i, where i € {0, 1, 2, . . . ,L — 1.} Our goal is to argue that 
destination k can decode all these L ^ L 2 +1 ' ) messages and hence symmetric rate per message is bounded as 

C ^ L(L+1)- 

Consider any reliable achievable index coding scheme. Assume a genie provides all the messages except 
Wo for all the destinations k, k + 1, • • • , k + L — 1. With the considered index coding scheme, destination fc, 
which has Wo° as antidotes, can decode the "blue messages" in 1 189) , i.e., 



{W k 



kL,kL+L-l,(k+l)L+L-2,--- ,(k+i)L+L-i-l,--- ,(fc+L-2)L+l 



}■ 



destination k + 1 can decode its desired L — l "red" messages - {W k L+L,(k+x)L+L-i,- ,(k+i)L+L-i,- ,(fe+i-2)L+2} 
- using Wo c U {W^l} as antidote. This automatically implies that destination k, having decoded all the blue 
messages including WkL can decode the red messages as well. Now, having decoded all the blue messages 
and red messages, the set of messages known to destination k includes Wo° U WkL,kL+L-i ,(fc+i)i " tne an ~ 
tidote at destination fc + 2. Therefore all the green messages can be decoded at destination k. Proceeding 
further similarly we can argue that destination k can decode all the messages in Wo ■ This completes the 
proof. 



10 Conclusion 



As evident from this work, interference alignment is integral to the index coding problem. The interfer- 
ence alignment perspective allows us to not only solve fairly complex index coding problems, but also it 
makes the intuition behind the capacity optimal solutions quite transparent. As with wireless networks, 
while much of the initial intuition from interference alignment schemes is based on dimension counting 
based on linear codes, with few exceptions the dimension counting bounds are readily translated into tight 
information theoretic bounds. It is also remarkable that the interference alignment perspective allows us 
to prove the insufficiency of linear codes for multiple unicast index coding. As a side remark, we note 
that the capacity results of the index coding settings explored in this work, since they rely only on vector 
linear achievable schemes that readily translate into the field of complex numbers, directly establish corre- 
sponding DoF results for the cellular blind interference alignment settings as well. We end this paper with 
a couple of intriguing questions. 

First, it is not clear that auxiliary messages and destinations, while convenient for our purpose, are nec- 
essary in the equivalent multiple unicast problem. Since the purpose of auxiliary messages and destinations 
is only to force the expanded messages into alignment, the natural question is if auxiliary messages and 
destinations are not included, would it be possible to achieve a higher min rate for the expanded messages 
in the multiple unicast setting, presumably through a non-aligned solution? While this possibility seems 
unlikely, we do not yet have a proof that it is impossible. Such a proof would be desirable because it would 
make the equivalence between groupcast and unicast settings more direct. 

The second question pertains to the capacity of the X channel setting with finite number of users. We 
showed in Section 4.3.3 that the capacity solution for the case where the number of users is infinity, is 
L( - L 2 +1 ^ per message, where L is the number of messages per transmitter or per receiver. However, note 
that the finite user setting for the X channel, studied in Example 3 presented in Section 3.3 is also consistent 
with this result. In that example, we have 3 messages per source/destination, i.e., L = 3 and we achieve 
rate ^ per message, even though the number of users is finite. A similar observation can be made for the 
L = 2 setting considered in 1 5 ] . This suggests an interesting possibility - is it always possible to achieve rate 
L ( L 2 +1 ) per message with finite number of users K? Note that the outer bound applies to finite K settings as 
well. On the other hand, even if the optimal rate per message is the same for finite K as well as infinite K, 
evidently the alignment solution can be much more complex for finite K settings. Note that a sophisticated 
subspace alignment solution is needed in Section [33) but much simpler orthogonal solutions suffice when 
K is infinity, as shown in Section [9 

Finally, we conclude with the observation that the index coding problem remains still an open problem 
of great interest, and we expect that the insights from interference alignment will continue to be useful 
not only in solving smaller networks or symmetric versions of extended networks as shown here, but 
perhaps also in other directions not explored in this work — e.g., designing interference alignment inspired 
algorithms for arbitrary index coding settings, and studying order optimality of interference alignment 
techniques in index coding settings modeled as random graphs. 
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